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PURIFIED THERMOSTABLE INORGANIC PYROPHOSPHATASE 
OBTAINABLE FROM THERMOCOCCUS LITORALIS 



CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application is a continuation-in-part of U.S. Application Serial 
No. 08/329,721 filed October 25, 1994. 

FIELD OF THE INVENTION 



The present invention relates to an extremely thermostable enzyme. More 
specifically, it relates to a thermostable inorganic pyrophosphatase obtainable from 

1 5 Thermococcus litoralis, to the recombinant DNA which encodes this enzyme and to 

the production of this enzyme from the recombinant DNA. 

BACKGROUND OF THE INVENTION 

2 0 Inorganic pyrophosphatase (pyrophosphate phosphohydrolase) (PPase) is 

an enzyme that plays an important role in energy metabolism. This enzyme is 
responsible for the hydrolysis of pyrophosphate (PPi) which is formed principally 
as the product of the many biosynthetic reactions that utilize nucleoside 
triphosphates, such as DNA and RNA synthesis, coenzyme synthesis and the 

2 5 activation of amino acids and fatty acids. This enzyme is considered to maintain the 

forward direction of these reactions and is vital for maintaining viability. For a 
review of microbial inorganic pyrophosphatases, see, Lahti, R., Microbiol. Review 
(1983) 47:169-179, the disclosure of which is hereby incorporated by reference 
herein. 

30 

Inorganic pyrophosphatases occur widely in nature. The two best-studied 
PPases are those from Saccharomyces cerevisiae and Escherichia coli. See, for 
example, Cooperman, B.S., Methods EnzymoL (1982) 87:526-548, and Josse, J. 
and Wong, D.C.K. in Enzymes (1971), 3rd Ed., Vol. 4 (Boyer, P.D., ed.) pp. 

3 5 499-527, Academic Press, New York. Research (the disclosure of which is hereby 

incorporated by reference herein), has also been conducted on the isolation and 
purification of inorganic pyrophosphatases from other microorganisms. Four 
PPases have been isolated from the Archaea: Methanobacterium 
thermoautotrophicum (strain AH) in Alebeek, et al., Biochim. Biophys. Acta 
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(1994) 1206:231-239, the disclosure of which is hereby incorporated by reference 
herein; Methanothrix soehngenii in Jetten, et aL, Arch. Microbiol. (1992) 157:284- 
289, the disclosure of which is hereby incorporated by reference herein; 
Thermoplasma acidophilum in Richter and Schafer, Eur. J. Biochem. (1992) 
209:343-349, the disclosure of which is hereby incorporated by reference herein; 
and Sulfolbus acidocaldarius strain 7 in Wakagi, et al., Biochim. Biophys. Acta 
(1992) 1 120:289-296, the disclosure of which is hereby incorporated by reference 
herein, and S. acidocaldarius (DSM 639) in Meyer, et al., Arch. Biochem. 
Biophys. (1995) 319:149-156, the disclosure of which is hereby incorporated by 
reference herein. Furthermore, the 3-dimensional structure of three PPases has 
been determined by X-ray crystallography: Saccharomyces cerevisiae (Baker's 
yeast) in Arutiunian, et aL, Dokl. Akad. Nauk. SSSR (1981) 258:1481, the 
disclosure of which is hereby incorporated by reference herein; E. coli in 
Kankare, et al., Protein Engineering (1994) 7:823-830, the disclosure of which is 
hereby incorporated by reference herein; and Thermus thermophilus in Teplyokov, 
et al., Protein Science (1994) 3:1098-1 107, the disclosure of which is hereby 
incorporated by reference herein. 

Two different categories of inorganic pyrophosphatases have been 
differentiated: soluble (cytoplasmic) and membrane- bound. This differentiation is 
based not only on cellular localization, but on subunit structure. Cytoplasmic 
PPases are generally oligomeric proteins consisting of identical subunits. Enzymes 
from eubacterial and archaeal sources tend to be 19 to 23 kDa subunits and exist as 
either a tetramer or hexamer. See Alebeek, et al., supra (the disclosure of which is 
hereby incorporated by reference herein). Dissociation to dimers and trimers has 
been shown to exist in the absence of divalent cations, Icheba et al., J. Biochem. 
(1990) 108: 572-578. Eucaryotic enzymes are homodimers of 32 to 35 kDa and are 
exclusively dimers. The only known exception to these rules is the PPase from 
Methanothrix soehngenii that posses an CX2B2 subunit structure, Jetten, et al., supra 
(the disclosure of which is hereby incorporated by reference herein). 

All known PPases require the presence of divalent metal cations, with 
magnesium conferring the highest activity. One exception to this rule is the PPase 
from Bacillus subtilis in which the optimal activity occurs in the presence of Mn 2 +. 
See Tono and Romberg, J. Biol. Chem. (1967) 242:2375-2382, the disclosure of 
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which is hereby incorporated by reference herein. Kinetic studies have shown that 
the complex of Mg 2+ and PP* (MgPPi 2 -) is the true substrate and not PPj alone. 
The addition of free magnesium ions activates the enzyme by direct binding, while 
free PPj is thought to inhibit the enzyme. For details of PPase kinetics, see 
5 Rapoport, et al., Eur. J. Biochem. (1972) 26:237-246 and Lahti, R., supra (the 

disclosures of which are incorporated by reference herein). 

One interesting feature of this enzyme is its unusual thermostability; the 
inorganic pyrophosphatase appears to be one of the most thermostable enzymes of 
10 an organism. Research has shown that the thermostability of inorganic 

pyrophosphatases is increased by the presence of divalent cations. Commercially 
available PPases are heat resistant in the presence of Mg 2 + up to the following 
temperatures: Escherichia coli (80*C), Saccharomyces cerevisiae (Baker's yeast) 
(50°C) and Bacillus steaorthermophilus (80°C). Heat resistance is defined as the 

1 5 temperature at which the activity of the enzyme decreases not more than 5% within 

10 minutes. See, Schreier, E. and Hohne, W. E., FEBS Utters (1978) 90: 93- 
96, the disclosure of which is hereby incorporated by reference herein. 

The thermostability of other inorganic pyrophosphatases has also been 

2 0 studied. The two thermophilic PPases from thermophilic bacterium PS -3 and 

Bacillus Stearothermophilus were found to be thermostable to 75°C in the presence 
of Mg 2+ ; alone in Tris-HCl buffer these two thermophilic PPases were not 
intrinsically stable, with the PS-3 PPase losing activity at 40°C. See, Hachimori, et 
aL, /. Biochem. (1978) 86:121-130, the disclosure of which is hereby incorporated 

2 5 by reference herein. The PPase from Sulfolobus acidocaldarius strain 7 retains 

complete activity after incubation at 100°C for 10 minutes in the presence of MgCl2; 
see, Wakagi, et aL, supra. The PPase from Thermus thermophilus is heat killed at 
96 # C and retains 50% activity after incubation at 90*C for one hour in the presence 
of MgCl 2 ; see Kuranova, et al., Dokl. Akad. Nauk. SSSR (1987) 295:1013-1016 

3 0 and Hohne, et al., Biomed. Biochim. Acta (1988) 47:941-947, the disclosures of 

which are hereby incorporated by reference herein. The PPase from Thiobacillus 
thiooxidans retains 90% activity at 80°C and 40% activity at 100°C in the presence 
of Mg 2+ ; see, Tominga, N. and Mori, T., J. Biochem. (1977) 81:477-483, the 
disclosure of which is hereby incorporated by reference herein. The PPase from 
3 5 Thiobacillus ferrooxidans retains 8% activity after incubation at 100*C for 60 
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minutes; see, Howard, A. and Lundgren, D.G., Can. J. Biochem. (1970) 
48:1302-1307, the disclosure of which is hereby incorporated by reference herein. 

Research has been conducted on the effect of pyrophosphorolysis (the 
reverse reaction of polymerization) and the use of pyrophosphatases in DNA 
sequencing. Tabor and Richardson, J. Biol Chem. (1990) 265:8322-8328, the 
disclosure of which is hereby incorporated by reference herein, discovered that 
pyrophosphorolysis by bacteriophage T7 DNA polymerase can lead to degradation 
of specific dideoxy-nucleotide-terminated fragments on DNA sequencing gels. The 
variation in band intensities in a DNA sequencing reaction could be prevented by 
the addition of inorganic pyrophosphatase to reduce the level of inorganic 
pyrophosphate. Uniform band intensities are extremely helpful in the analysis of a 
DNA sequence, particularly with automated DNA sequencers. 

Moreover, research has been conducted on the use of inorganic 
pyrophosphatase to improve the yield of in vitro transcription reactions catalyzed 
by T7 RNA polymerase. See Cunningham and Ofengand, Biotechniques (1990) 
9:713-714, the disclosure of which is hereby incorporated by reference herein. 

Accordingly, there is a desire in the art to obtain and produce a purified, 
highly thermostable inorganic pyrophosphatase that may be used to eliminate the 
problems of pyrophosphorolysis including reactions higher than 37°C up to 100°C 
or higher, in any such process where an accumulation of pyrophosphate could be a 
problem; for example, any process where DNA polymerases are used in 
recombinant DNA technology, such as thermal cycle sequencing and primer 
extension reactions. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, there is provided an extremely 
thermostable enzyme obtainable from T. litoralis which catalyzes the hydrolysis of 
inorganic pyrophosphate (PP0 to orthophosphate (Pi). The thermostable enzyme 
obtainable from T. litoralis consists of six identical subunits: the molecular mass of 
the native enzyme as estimated by gel filtration was approximately 125 kDa; the 
apparent molecular weight for a single subunit was determined to be 20 - 21 kDa by 
SDS-PAGE and 19,667 Da by the DNA sequence. The optimum activity can be 
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observed at a temperature of 72'C which is similar to the temperature (75 # C) for 
maximum activity of the T. litoralis DNA polymerase, also known as VentR® 
DNA polymerase (New England Biolabs, Inc.; Beverly, Massachusetts), which is 
disclosed in U.S. Patent No. 5,210,036 and 5,322,785, the disclosures of which 
5 are hereby incorporated by reference herein. The pH optimum of the enzyme was 

9.5 (at 25 # C) in both Tris-HCl and Glycine-NaOH buffers. The thermostable 
inorganic pyrophosphatase is highly specific for pyrophosphate (PPO and Mg 2 +. 
The K m value for inorganic pyrophosphate was estimated to be 1.0 mM. 
Furthermore, the inorganic pyrophosphatase is extremely thermostable and exhibits 
1 0 100% activity after incubation at 100*C for four hours. The T. litoralis PPase has a 

substantially greater thermal stability than the other commercially available PPases 
from Escherichia coli, Saccharomyces cerevisiae (Baker's yeast) and Bacillus 
steaorthermophilus. 

1 5 The DNA encoding for the 20 - 21 kDa thermostable inorganic 

pyrophosphatase subunit obtainable from T. litoralis has been isolated and 
provides another means to obtain the thermostable enzyme of the present invention. 



20 



BRIEF DESCRIPTION OF DRAWINGS 

Figure 1 - is a photograph of the SDS-polyacrylamide gel of Example I. 
Lane 1 shows the molecular weight markers and lanes 2,3,4 show the purified T. 
litoralis inorganic pyrophosphatase. 

2 5 Figure 2 - is the partial DNA sequence (SEQ ID NO: 1) of the T. litoralis 

PPase gene and its corresponding amino acid sequence (SEQ ID NO:2) of Example 
H.D.; the DNA sequence was obtained from a PCR-generated fragment. 
Highlighted in bold lettering are: (1) the N-terminal and C-terminal amino acid 
sequences used to prepare degenerate primers for both PCR and DNA sequencing; 

3 0 (2) an internal amino acid sequence used to prepare a third primer for DNA 

sequencing. 

Figures 3 A-3B - is an alignment of the T. litoralis amino acid sequence of 
Example II.D. with known amino acid sequences for cytoplasmic 
3 5 pyrophosphatases. Legend: The partial Thermococcus litoralis (T.li) amino acid 

sequence (SEQ ID NO:8) was compared to the alignment of Saccharomyces 
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cerevisiae (S.ce) (SEQ ID NO:3), Arabidopsis thaliana (A.th) (SEQ ID NO:4), 
Escherichia coli (E.co) (SEQ ID NO:5), thermophilic bacterium PS-3 (SEQ ID 
NO:6) and Thermoplasma acidophilwn (T.ac) (SEQ ID NO:7) from Richler, H. 
and Schiifer, G., Eur. /. Biochem. (1992) 209:351-355, the disclosure of which is 
hereby incorporated by reference herein. Identical amino acid positions are boxed; 
(•) indicates amino acids considered to be involved in catalytic action. 

Figure 4 - is a flow chart diagram illustrating the steps involved in using 
inverse polymerase chain reaction (hereinafter "IPCR") to amplify the entire 
inorganic pyrophosphatase gene from T. litoralis genomic DNA, see Example 
H.E.. The known DNA sequence (265 bp) is shaded solid black while the 
unknown flanking sequences of interest are shaded a lighter color. Arrows directly 
over or under the region of known sequence, represents oligonucleotide priming 
sites used to amplify the unknown region. 

Figures 5A-5B - is the complete DNA sequence (SEQ ID NO:9) of the T. 
litoralis PPase gene and its corresponding amino acid sequence (SEQ ID NO: 10) 
from Example III.B.. The entire gene is 531 bp which codes for a 176-amino-acid 
protein. 

Figure 6 - is a schematic map of pAII17/PPase, a plasmid that encodes for 
the extremely thermostable T. litoralis inorganic pyrophosphatase of Example 
III.C. 

Figure 7 - is a schematic map of pUC18/PPase, a plasmid that encodes for 
the extremely thermostable T. litoralis inorganic pyrophosphatase of Example 
III.D. 

Figure 8 - is a photograph of the SDS-polyacrylamide gel of Example IV, 
showing the purification steps of the recombinant T. litoralis PPase. Lane 1, crude 
extract (2 hour induction); lane 2, after heat step at 85°C for 15 minutes; lane 3, 
DEAE pool; and lane 4, prestained protein markers, broad range (New England 
Biolabs, Inc.; Beverly, Massachusetts), in order of decreasing molecular mass: 
MBP-B-galactosidase, MBP-paramyosin, glutamic dehydrogenase, aldolase, 
triosephosphate isomerase, B-lactoglobulin A, lysozyme, aprotinin. 
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Figure 9 - is a graph showing the inorganic pyrophosphatase activity of 
Example V as it varies with temperature (25 - 10CTC). Optimum temperature for the 
enzyme was approximately 75°C. 

Figure 10 - is a graph showing the optimum temperature for 
pyrophosphatase activity for both the native and recombinant T. litoralis inorganic 
pyrophosphatases of Example V. PPase activity was tested between 25°C and 
100°C. Optimum temperature is more specifically 72°C, and not 75°C as reported 
in Figure 9. 

Figure 1 1- is a graph showing the effect of varying pH on the activity of the 
T. litoralis PPase of Example VL Activity was analyzed in both Tris-HCl (pH 7.5 - 
9.5) and Glycine-NaOH (pH 9.0 - 10.5) buffers; optimum pH was 9.5 in both 
buffers, indicating that the T. litoralis PPase is an alkaline inorganic 
pyrophosphatase. 

Figure 12 - is a graph of the molecular weight estimation of T. litoralis 
PPase by gel filtration on a Superose 12 HR 10/30 (Pharmacia; Piscataway, New 
Jersey). The column was eluted with 50 mM Tris-HCl (pH 7.5) containing 100 
mM NaCl buffer at a flow rate of 0.5 ml/min. Standards employed were as follows: 
(1) ribonuclease A, 13.7 kDa; (2) chymotrypsinogen, 25 kDa; (3) ovalbumin, 43 
kDa; (4) bovine serum albumin, 67 kDa; (5) aldolase, 158 kDa; (6) catalase, 232 
kDa. 

Figures 13A-13B - is partial trace data from ml3mpl8 sequenced with 
AmpliTaq DNA polymerase (Applied BioSystems Division, Perkin-Elmer 
Corporation; Foster City, California) with and without the addition of T. litoralis 
PPase of example VIL The data shown in panel (13A-1 and 13A-2) (partial trace 
data from an ml3mpl8 template sequenced with AmpliTaq DNA polymerase 
(Applied BioSystems Division, Perkin-Elmer Corporation; Foster City, California) 
(no T. litoralis PPase)) was produced without the addition of PPase and contains a 
weak Guanine (G) peak at position 164 and a missing G peak, miscalled as 
Unknown (N), at position 310. When the same ml3mpl8 template in panel (13B-1 
and 13B-2) (partial trace data from an ml3mpl8 template sequenced with 
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AmpliTaq DNA polymerase (Applied BioSystems Division, Perkin-Elmer 
Corporation; Foster City, California) in the presence of 7. litoralis inorganic 
pyrophosphatase) is sequenced with the addition of PPase (one NEB unit, NEB = 
New England Biolabs, Inc.; Beverly, Massachusetts), the peaks are resolved. 

Figures 14A-14B • is the partial trace data from ml3mpl8 sequenced with 
AmpliTaq DNA polymerase (Applied BioSystems Division, Perkin-Elmer, Foster 
City, California) with and without the addition of recombinant T. litoralis PPase of 
Example VII. The data shown in panel (Figure 14 A) (control - ml3mpl8, 
DyePrimer chemistry (-12mer), no PPase) was produced without the addition of 
PPase and contains missing G and A peaks at position 316 and 317 respectively. 
When the same ml3mpl8 template in panel (Figure 14B) (+ PPase - m!3mpl8, 
DyePrimer chemistry (-21mer) - 2 NEB units PPase (recombinant)) is sequenced 
with the addition of PPase (2 NEB units), the peaks are resolved, here shown at 
position 310 and 311. 

Figure 15 - is an illustration of pAGR3. pAGR3 was constructed by 
inserting the EcoRl fragment containing the lacW gene from pACYC184/lacN (J.C. 
Wang, L J. Peck and K. Becherer (1983) Cold Spring Harbor Symposium 47:85- 
91, the disclosure of which is hereby incorporated by reference herein) into the 
Eagl site of pRS415 (R.W. Simons, R. Houman and N. Kleckner Gene (1987) 
53:85-96, the disclosure of which is hereby incorporated by reference herein) with 
the laclq gene in the opposite orientation to the beta-lactamase gene on pRS415. 
Prior to the ligation, the termini of both DNA fragments were converted to blunt 
ends using the Klenow fragment of DNA polymerase I. The pRS415/lacN 
construct was cleaved with EcoRI and SalU and the fragment bearing the replication 
origin ligated to a synthetic duplex (below) containing the Ptac promoter, a 
ribosome binding site, and a mulitple cloning site. This final construct was named 
pAGR3. Promoter and cloning site map (SEQ ID NO:28). Plasmid p AGR3 is an 
expression vector which includes several elements: 

1 . A synthetic tac promoter coupled to a symmetric synthetic lac 
operator sequence; 

2 . A lac ribosome binding site; 

3 . A polylinker for clotting, with the ATG within the Ncol site being 
about seven nucleotides downsteam of the ribosome binding site; 
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5. 
6. 



The replication origin from pBR322; 
Ampicillin resistance gene; and 

A four-fold copy of the ribosomal transcription terminator upstream 
of the tac promoter. These transcription terminators lower the basal 
level of transcription by reducing read-through transcription from 
upstream promoters. 



7. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The preferred thermostable enzyme herein is an inorganic pyrophosphatase 
obtainable from Thermococcus litoralis strain NS-C (DSM No. 5473). T. litoralis 
was isolated from a submarine thermal vent near Naples, Italy in 1985. This 
organism, T. litoralis, is an extremely thermophilic, sulfur metabolizing archaea, 
with a growth range between 55°C and 98°C, Neuner, et al., Arch. Microbiol. 
(1990) 153:205-207, the disclosure of which is hereby incorporated by reference 
herein. A sample of T. litoralis was deposited with the American Type Culture 
Collection (ATCC) on September 17, 1991 and has received Accession No. 55233. 

For recovering the native protein, T. litoralis may be grown using any 
suitable technique, such as the technique described by Belkin, et al., Arch. 
Microbiol. (1985) 142:181-186, the disclosure of which is hereby incorporated by 
reference herein. 

Strain NS-C was routinely grown in 2216 marine broth diluted to 1/2 
strength in synthetic seawater (Turks Island Salts, Merck, 1976). For some of the 
studies, an artificial seawater medium (ASW) was used, as described by Ruby, et 
al., AppL Environ. Microbiol. (1981) 42:317-324, the disclosure of which is 
hereby incorporated by reference herein. In both cases, a vitamin mix (thiamine- 
HC1, biotin and B^, 40 jug ml" 1 each), resazurin (0.8 mg H) and Pipes buffer (20 
mM, pH 7.2) were added. 

Liquid medium (10 ml, with or without sulfur or organic compounds) was 
placed in Hungate tubes and flushed for 10 minutes with N2. Na-thioglycolate was 
then injected to make a 0.02% solution, and the tubes were preincubated at 88°C. 
After approximately 1 hour, during which the resazurin turned colorless, the 
medium was inoculated (to 10 6 cells ml' 1 with an 18-24 h culture) and the tubes 
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were incubated at 88°C without shaking. Steam-autoclaved (110°C) sulfur, when 
used, was added in 100 mg amounts to 10 ml medium. 

After cell growth, one preferred method for isolation and purification of 
PPase is accomplished using the multi-step process as follows. 

First, the cells, if frozen, are thawed, suspended in a suitable buffer such as 
buffer A (10 mM Tris buffer pH 7.5, 1.0 mM EDTA, 1.0 mM beta- 
mercaptoethanol), sonicated and centrifuged. The supernatant is then passed 
through a column which has a high affinity for proteins that bind to nucleic acids 
such as Affigel blue column (Biorad; Cambridge, Massachusetts). The nucleic 
acids present in supernatant solution of T. litoralis and many of the proteins 
including a significant fraction of PPase pass through the column and are thereby 
pooled by washing the column with several column volumes of low salt buffer 
(100 mM NaCl) at pH of about 7.5. After washing, a linear gradient such as 0.1 to 
2.0 M NaCl buffer A is applied to the column; PPase activity is found in the flow 
through and wash. The Affigel blue flow through/wash (250 ml aliquot) is applied 
to a CM" Sepharose (cation exchanger) column. The column is washed and eluted 
with a linear gradient of 0.1 to 1.0 M NaCl in buffer A. All PPase is in the flow 
through/wash. The CM* Sepharose flow through pool is applied directly to a 
Heparin Sepharose (cation exchanger) column to remove any remaining cations and 
contaminating DNA. The column is washed and eluted with a linear gradient of 0.1 
to 1.0 M NaCl. Again 100% of the PPase activity is found in the flow 
through/wash. The Heparin flow through/wash is applied to a Hydroxylapatite 
column and washed with 0.1M NaCl buffer A. After washing, the enzyme is eluted 
with a linear gradient of 0 - 20% (NH4) 2 S0 4 . Two peaks contain PPase activity are 
detected; the peak containing the greatest specific activity for the enzyme is dialyzed 
against buffer A and applied to a high performance liquid chromatography column 
(HPLC) mono-Q column (anion exchanger). PPase is again eluted with a linear 
gradient such as 0.05 to 0.7M NaCl in a buffer A. The fractions having 
thermostable inorganic pyrophosphatase activity are pooled, dialyzed against 1.2 M 
(NH4)2S04 buffer A and applied to an HPLC phenyl- superose column 
(hydrophobic). PPase is again eluted with a linear gradient such as 1.2 - 0 M 
(NH4)2S04 in buffer A. The fractions with enzyme activity are dialyzed against 
0.05 M NaCl in buffer A and applied to an HPLC Q-HyperD column (Biosepra; 
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Marlborough, Massachusetts) (anion exchanger) to concentrate the enzyme. PPase 
is eluted with a linear gradient such as 0.05 to 0.7 M NaCl in buffer A. Two peaks 
contain pyrophosphatase activity with one peak achieving 95% purity at this stage. 

The apparent molecular weight of PPase obtainable from T, litoralis when 
electrophoresed on an SDS-polyacrylamide gel (SDS-PAGE) is approximately 20 - 
21 kDa when compared with protein standards of known molecular weight, such as 
trypsin inhibitor (soybean) assigned a molecular weight of 21,500 Da. It should be 
understood, however, that as a protein from an extreme thermophile, T. litoralis 
inorganic pyrophosphatase may electrophorese at an aberrant relative molecular 
weight due to failure to completely denature or other intrinsic properties. The exact 
molecular weight may be determined from the coding sequence of the T. litoralis 
inorganic pyrophosphatase gene. The apparent molecular weight of 20 - 21 kDa 
agrees with reported cytosolic PPases and suggests that the archaea enzyme is 
similar to most of the eubacterial pyrophosphatases which are composed of four to 
six identical subunits of 19 -21 kDa/ monomer. The T. litoralis PPase, under non- 
denaturing conditions, consists of six identical subunits with an approximate 
molecular weight of 125,500 Da as estimated by gel filtration chromatography 
using a Superose 12 HR 10/30 (Pharmacia; Piscataway, New Jersey) and 
comparing the elution volumes of several known calibration standards, such as 
aldolase (rabbit muscle) and bovine serum albumin assigned molecular weight 
values of 158,000 Da and 67,000 Da respectively. 

The PPase activity is measured by following the formation of inorganic 
pyrophosphate to orthophosphate. Two different methods are used to measure 
PPase activity in the present invention. The first method was adapted for rapid 
screening of multiple column fractions and uses thin layer chromatography (TLC). 
PPase is incubated at 75 0 C or 72"C for 10 to 60 minutes with 25 nmoles inorganic 
pyrophosphate in a reaction volume of 10 jil (under oil). NEB CircumVent™ 
buffer is used (10 mM KC1, 10 mM (NH4) 2 S0 4 , 20 mM Tris-HCl, 5 mM MgS0 4 , 
pH 8.8 @ 25°C). 1.5 |al of the reaction is spotted onto a TLC cellulose plate 
(Merck; Frankfurter, Germany; Product No. 5718) and developed in the following 
solvent: dioxane (134 ml), distilled water (55 ml), trichloroacetic acid (lOg) and 
concentrated NH4OH (0.5 ml). After drying the plate, it is sprayed with 1% 
ammonium molybdate followed by 1% stannous chloride in 10% HC1. PPi and Pi 
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are visualized, complete and incomplete reactions can be discriminated. One NEB 
unit is defined as the amount of enzyme that will convert 25 nmoles inorganic 
pyrophosphate to orthophosphate at 75*C in one hour under these conditions. 

The second method for assaying PPase activity is a more quantitative 
measurement of the (xmoles of orthophosphate liberated and is an adaptation of the 
method of Taussky and Schorr, /. Biol. Chem. (1953) 202:675 - 685, the 
disclosure of which is hereby incorporated by reference herein. The PPase is 
incubated at 75 °C for 10 minutes in a 100 pi reaction containing 30 mM Tris pH 
9.0, 1.5 mM MgCl2 and 1.5 mM sodium pyrophosphate. The enzyme is then 
cooled quickly to room temperature and complexed with 0.9 ml of Taussky-Schorr 
reagent (5.0 g ferrous sulfate, 10 ml 10% ammonium molybdate in 10 N sulfuric 
acid, in a final solution volume of 100 ml.) After 10 minutes, an absorbance 
reading is taken at 660 nm. The absorbance reading was compared to a standard 
curve using Sigma's (St. Louis, Missouri) phosphorous standard (#661-9). One 
standard unit will liberate 1.0 (imole of inorganic orthophosphate per minute under 
the above conditions. Optimal conditions were later determined to be 30 mM 
Glycine-NaOH, pH 9.5 at 72°C with a 1 : 1 ratio of Mg 2+ : PPj where both Mg 2+ 
and PPi were 1.5 mM. 

The PPase of the present invention is extremely thermostable. The enzyme 
was incubated at 100°C for four hours in the absence of Mg 2+ in a 50 mM Tris pH 
9.0 buffer. Aliquots were removed at 1, 2, 3, and 4 hours and assayed for PPase 
activity using the modified method of Taussky and Schorr, supra (the disclosure of 
which is hereby incorporated by reference herein). 100% of the original activity 
was detected at 75°C after the 4 hour incubation. The T. litoralis PPase experienced 
no heat inactivation under the above conditions. 

The thermostable enzyme of this invention may also be produced by 
recombinant DNA techniques, as the gene encoding this enzyme has been cloned 
from 7. litoralis genomic DNA. The complete coding sequence for the T. litoralis 
inorganic pyrophosphatase exists as a 531 bp fragment with various engineered 
restriction sites on each end. 
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The production of a recombinant form of T. litoralis PPase generally 
includes the following steps: DNA is isolated which encodes the active form of the 
PPase, either in its native form or as a fusion with other sequences, such as 
maltose-binding protein (EPO Publication No. 0 286 239, the disclosure of which 
5 is hereby incorporated by reference herein), which may or may not be cleaved away 

from the native form of the PPase and which may or may not effect 
pyrophosphatase activity. Next, the gene is operably linked to appropriate control 
sequences for expression in either prokaryotic or eukaryotic host/vector systems. 
The vector preferably encodes all functions required for transformation and 
1 0 maintenance in a suitable host, and may encode selectable markers and/or control 

sequences for T. litoralis PPase expression. Active recombinant thermostable 
inorganic pyrophosphatase can be recovered either from within host cells or from 
the culture media if the protein is secreted through the cell membrane. 

1 5 Each of the above steps can be accomplished in a number of ways. Vectors 

useful in practicing the present invention should provide varying degrees of 
controlled expression of T. litoralis PPase by providing some or all of the 
following control features: (1) promoters or sites of initiation of transcription, either 
directly adjacent to the start of the gene or as fusion proteins, (2) operators which 

2 0 could be used to turn gene expression on or off, (3) ribosome binding sites for 

improved translation, and (4) transcription or translation termination sites for 
improved stability. Appropriate vectors used in cloning and expression of T. 
litoralis PPase include, for example, phage and plasmids. Example of phage 
include lambda gtl 1 (Promega; Madison, Wisconsin), lambda Dash (Stratagene; La 

2 5 Jolla, California), lambda ZapII (Stratagene; La Jolla, California). Examples of 

plasmids include pBR322, pBluescript (Stratagene; La Jolla, California), PAGR3 
(Figure 15), pRRS (Skoglund, et aL, Gene, (1990) 88:1-5, the disclosure of 
which is hereby incorporated by reference herein), pSP73 (Promega; Madison, 
Wisconsin), pGW7 (ATCC No. 40166) and pET3A (Rosenberg, et al., Gene, 

3 0 (1987) 56:125-135, the disclosure of which is hereby incorporated by reference 

herein). 

It was found that one preferred method for cloning the T, litoralis inorganic 
pyrophosphatase involved cloning the PPase gene directly under control of the T7 
3 5 promotor system in the vector pAII17. However, it will be apparent to those skilled 
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in the art that the preferred mode for practicing the present invention can vary in 
accordance with techniques known in the art. 

Transformation and Infection 

Standard protocols exist for transformation, phage infection and cell 
culture. Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory 
Manual, 2nd Edition (1989), the disclosure of which is hereby incorporated by 
reference herein. Of the numerous E. coli strains which can be used for plasmid 
transformation, the preferred strains include JM101 (ATCC No. 33876), XL1 
(Stratagene; La Jolla, California), and RRI (ATCC No. 31343). E. coli strain 
SL1, ER1578 and ER1458 (Raleigh, et al., N. A, Research (1988) 16:1563-1575, 
the disclosure of which is hereby incorporated by reference herein) are among the 
strains that can be used for lambda phage, and Y1089 can be used for lambda gtl 1 
lysogeny. When preparing transient lysogens in Y1089 (Arusu, et al., 
Experimental Parasitology (1987) 64:281-289, the disclosure of which is hereby 
incorporated by reference herein), a culture is infected with lambda gtll 
recombinant phage either by a single large dose of phage or by co-culturing with a 
lytic host. The infected y 1089 cells are preferably grown at 37°C in the presence of 
the inducer IPTG resulting in build-up of recombinant protein within the lysis- 
defective host/phage system. Other preferred strains include: 

Strain Genotype Reference QT ^nStriKtign 

ER2426 F>proA+B+laqI<lD(lacZ)M15zzf NEB #974 FR79S0 
::miniTnlO(Kanni-E. coli K-12 Construction: ER2418 X ER2250 
fhuAl supE44 el 4- rfhDl ? -->Kan* Aux+. 

relAl? endAl spoTl? thi-1 
I>(mcrC-mrr)114::IS10 

ER2497 F-lDE3=l5Bam//7o DEcoRI-B NEB #975 

int::lacI::PlacUV5::T7 genel ill Construction: ER2488 X 1DE3 

Y>nin5 E. coli B flxuAl [Ion] ->i21, nonphage-producmg (i.e., 

ompT gal sulAll D(mcrC-mrr) i n r), supports growth of T7 genel , 
114:ISO10R (mcr- 
73::miniTnlO; Tet s )2 
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Reference or construction 

Construction: ER2503-->Tet s 
selection on Bochner plates. 



Strain Genotvne 

ER2504 F-lDE3=lsBamHlo T>EcoRI-B 
int::lad::PlacUV5::T7 genel ill 
D*m5 E. coli BfhuA2[lon] 
ompT gal sulAll D(mcrC- 
mrr)U4::IS10R(mcr- 
73::miniTnlO;Tet s )2 endAl 
R(zgb210::Tn!0)Tet s 



Construction of Genomic DNA Expression Library and Screening for 
Thermostable Inorganic Pyroph osphatase 



1 0 



1 5 
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The most common methods of screening for a gene of choice are (1) by 
hybridization to homologous genes from other organisms, (2) selection of activity 
by complementation of a host defect, (3) reactivity with specific antibodies, (4) 
screening for enzyme activity or (5) by hybridization to a PCR-generated DNA 
fragment from the target gene. Screening with a specific antibody was discovered 
to not be the best approach since a T. litoralis PPase antibody raised in rabbit 
cross-reacted with E. coli PPase. The latter method was found to be the best 
approach with the present invention. However, a method for screening for 
inorganic pyrophosphatase activity has been developed by Kukko, E., et al. in 
FEMS Microbiology Letters (1982), 15:309-31 1, the disclosure of which is 
hereby incorporated by reference herein. This method, the magnesium 
pyrophosphate overlay test, involves cloning the gene of choice into a mutant E. 
coli K-12 strain (RT4) that produces a low level of PPase. See Lahti, R., et aL, 7. 
Bacteriol. (1988)170:5901-5907, the disclosure of which is hereby incorporated by 
reference herein. The colonies are screened rapidly by pouring the following 
mixture on the colonies: 65 mg NaN3 dissolved in 10 ml of distilled H2O, 0.2 ml 
of Triton X-100, 2 ml of 50 mM Tris-HCl buffer, pH 8.0 and 2 ml of 50 mM 
tetrasodium pyrophosphate. 2.5 ml of 0.2 M MgCb and 20 ml of hot (100°C) 2% 
agar solution in distilled water is added immediately before the mixture was poured 
on the colonies. After the agar has solidified (about 10 minutes) the plates are 
incubated at 37°C Haloes are inspected after 2 hours and 6 hours. 



30 



7\ litoralis DNA can be used to construct genomic libraries as either random 
fragments or restriction enzyme fragments. The latter approach is preferred. 
Preferably, Eco RI partial digestion reactions are prepared from T. litoralis 
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genomic DNA using standard DNA restriction techniques such as described in 
Sambrook, Fritsch and Maniatis, supra, the disclosure of which is hereby 
incorporated by reference herein. Other restriction enzymes such as Bam HI, Nru I, 
Xba I, Nsi I, Hind III and Pst I can also be used. 

Although methods are available to screen both plasmids and phage using 
DNA probes, in accordance with the present invention, it has been found that phage 
systems tend to work better with 7. litoralis and are therefore preferred for the first 
libraries. Since it is uncertain whether T. litoralis control regions function in E. 
coli, phage vectors which supply all necessary expression control regions such as 
lambda gtl 1 and lambda Zap II, are preferred. By cloning T. litoralis DNA into the 
Eco RI site of lambda gtl 1 , T. litoralis PPase may be expressed either as a fusion 
protein with beta-galactosidase or from its own endogenous promoter. 

Once formed, the expression libraries are screened with a PCR-generated 
DNA probe using standard hybridization procedures such as described in 
Sambrook, Fritsch, and Maniatis, supra, the disclosure of which is hereby 
incorporated by reference herein. 

An alternative method is to use inverse PCR on a restriction fragment of T. 
litoralis genomic DNA encoding the gene for the inorganic pyrophosphatase. This 
method allows one to rapidly amplify the DNA sequences that flank a region of 
known sequence. See Ochman, H., et al., Genetics (1988), 120:621-623, the 
disclosure of which is hereby incorporated by reference herein. 

The T. litoralis PPase, coding for all or part of the whole gene, can then be 
subcloned in, for example, the plasmids pBR322, pRRS, pAGR3, pBluescript, or 
pUC19. The DNA sequence can be determined by, for example, the Sanger 
dideoxy chain- terminating method (Sanger, R., Nicklen, S. & Coulson, R. R., 
PNAS (1977) 74:5463-5467), the disclosure of which is hereby incorporated by 
reference herein. Subcloning into over-expression vector systems (described 
below) can thus be performed. 
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Identification of DNA Encoding and Expression of the T. litoralis 

Inorganic Pyrgphpsnhatasf 

Several methods exist for determining that the DNA sequence coding for the 
5 T. litoralis inorganic pyrophosphatase has been obtained. These include, for 

example, comparing the amino-terminal sequence of the protein produced by the 
recombinant DNA to the native protein, or determining whether the recombinant 
DNA produces a protein which binds antibody specific for native T. litoralis 
PPase. In addition, research by Cooperman, et aL, Trends Biochem. Sci. (1992) 
1 0 17:262-266, the disclosure of which is hereby incorporated by reference herein, 

suggests that the active site of soluble inorganic pyrophosphatases are conserved 
among many species. X-ray analysis of the T. thermophilics PPase shows 15 
highly conserved residues that form part of the active site. See Kuranova, et. aL, 
supra. Therefore, by comparing the predicted amino acid sequence of the cloned 

1 5 gene with the amino acid sequence of other known soluble PPases, such as E. coli, 

5. cerevisiae and T. thermophilics, the identification of these islands of homology 
provides strong evidence that the recombinant DNA indeed encodes an inorganic 
pyrophosphatase. 

2 0 Once identified, the DNA sequence coding for the T. litoralis PPase, can be 

cloned into an appropriate expression vector such as a plasmid derived from E. 
coli, for example, pET3A, pRRS, pAGR3, pBluescript or pUC19, the plasmids 
derived from the Bacillus subtilis such as pUB 110, pTP5 and pC194, plasmids 
derived from yeast such as pSH19 and pSH15, bacteriophage such as lambda 

2 5 phage, bacteria such as Agrobacterium tumefaciens, animal viruses such as 

retroviruses and insect viruses such as Baculovirus. 

Overexpression of the T. litoralis PPase can be achieved, for example, by 
separating the T. litoralis PPase gene from its endogenous control elements and 

3 0 then operably linking the pyrophosphatase gene to a very tightly controlled 

promoter such as T7 expression vector. See, Rosenberg, et aL, Gene (1987) 
56:125-135, the disclosure of which is hereby incorporated by reference herein. 
Insertion of the strong promoter may be accomplished by identifying convenient 
restriction targets near both ends of the T. litoralis PPase gene and compatible 
3 5 restriction targets on the vector near the promoter, and transferring the T. litoralis 

PPase gene into the vector in such an orientation as to be under transcriptional and 
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translation^ control of the strong promoter. Alternatively, PCR can be used to 
generate desirable sites at both ends of the T. litoralis PPase gene for cloning into 
an appropriate expression vector. Basic methods for cloning PCR-generated 
fragments include: the incorporation of restriction enzyme sites into 
deoxyoligonucleotide primers, T/A cloning, ligase-independent cloning, UDG 
cloning and blunt-ended cloning. See Costa, G.L., et al., PCR Methods and 
Applications (1994) 3:338-345, the disclosure of which is hereby incorporated by 
reference herein. The strategy of cloning the T. litoralis PPase gene into a vector 
such as pAII17 with a T7 promotor system is the preferred method. 

T. litoralis PPase may also be overexpressed by utilizing a strong ribosome 
binding site placed upstream of the T. litoralis PPase gene to increase expression 
the gene. See, Shine and Dalgarno, Proc. Natl Acad. ScL USA (1974) 71:1342- 
1346, the disclosure of which is hereby incorporated by reference herein. 

The recombinant vector is introduced into the appropriate host using 
standard techniques for transformation and phage infection. One such appropriate 
host for a PPase gene is the mutant E. coli K-12 strain (RT4), as previously 
discussed. Another such appropriate host is ER2497/pLysS and ER2504/pLysS in 
which the pLysS lowers background expression. Methods of transformation are, 
for example, the calcium chloride method, as described by Cohen, S. N., PNAS 
(1972) 69:21 10, the disclosure of which is hereby incorporated by reference 
herein, is used for E. coli . The transformation of Bacillus is carried out according 
to the method of Chang, S., et al., Molecular and General Genetics (1979) 
168: 1 1 1, the disclosure of which is hereby incorporated by reference herein. 
Transformation of yeast is carried out according to the method of Parent, et al., 
Yeast (1985) 1:83-138, the disclosure of which is hereby incorporated by 
reference herein. Certain plant cells can be transformed with Agrobacterium 
tumefaciens, according to the method describe by Shaw, C. H., et al., Gene 
(1983) 23:315, the disclosure of which is hereby incorporated by reference herein. 
Transformation of insect cells with Baculovirus is carried out according to, for 
example, the method described in Biotechnology (1988) 6:47, the disclosure of 
which is hereby incorporated by reference herein. 
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The transformants are cultivated, depending on the host cell used, using 
standard techniques appropriate to such cells. For example, for cultivating E, coli y 
cells are grown in LB media (Sambrook, supra) at 30°C to 42°C to mid-log or 
stationary phase. Cells may be induced with IPTG at mid-log or grown to 
5 saturation in the case of host cells containing a high copy plasmid, such as pUC18 

or pUC19. 

The T. litoralis PPase can be isolated and purified from a culture of 
transformed host cells, for example, by either extraction from cultured cells or the 
1 0 culture solution. 



When the T. litoralis PPase is to be extracted from a cultured cell, the cells 
are collected after cultivation by methods known in the art, for example, 
centrifugation. Then, the collected cells are suspended in an appropriate buffer 
1 5 solution and disrupted by ultrasonic treatment, lysozyme and/or freeze-thawing. A 

crude extract containing the T. litoralis PPase is obtained by centrifugation and/or 
filtration. 



When the T. litoralis PPase is secreted into the culture solution, i.e., alone 
2 0 or as a fusion protein with a secreted protein such as maltose binding protein, the 

supernatant is separated from the cells by methods known in the art. 

The separation and purification of the T. litoralis PPase contained in the 
culture supernatant or the cell extract can be performed by the method described 

2 5 above, or by appropriate combinations of known separating and purifying 

methods. These methods include, for example, methods utilizing solubility, such as 
salt precipitation; methods utilizing the difference in molecular weight, such as 
dialysis, ultra-filtration, gel-filtration, and SDS-polyacrylamide gel electrophoresis; 
methods utilizing a difference in electric charge, such as ion-exchange column 

3 0 chromatography; methods utilizing specific affinity, such as affinity 

chromatography; methods utilizing a difference in hydrophobicity, such as reverse- 
phase high performance liquid chromatography; methods utilizing a difference in 
isoelectric point, such as isoelectric focusing electrophoresis; and methods utilizing 
variations in thermostability, such as heating to denature labile enzymes. 

35 
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Stabilization and Use of the T. litoralis Inorganic Pyrophosphatase 

For long-term storage, the thermostable enzyme of the present invention 
maybe stored in the following buffer: 0.10 M KC1, 20 mM Tris (pH 8.0), 0.1 mM 
EDTA, 1 mM dithiothreitol and 50% glycerol at -20°C. 

The T. litoralis PPase of the present invention may be used for any 
purpose in which such an enzyme is necessary or desirable. For example, in 
recombinant DN A technology to eliminate the problems of pyrophosphorolysis (the 
reverse of polymerization). The accumulation of pyrophosphate could be a problem 
in any process where DNA polymerases are used, such as, second-strand cDNA 
synthesis in cDNA cloning, DNA sequencing and DNA amplification. See 
Sambrook, Frtisch and Maniatis, supra, Tabor and Richardson, supra., the 
disclosures of which are hereby incorporated by reference herein. The procedure of 
DNA amplification, commonly referred to as polymerase chain reaction (PCR), is 
disclosed in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159, the 
disclosures of which are hereby incorporated by reference herein. 

The following Examples are given to illustrate embodiments of the present 
invention as it is presently preferred to practice. It will be understood that the 
examples are illustrative, and that the invention is not to be considered as restricted 
except as indicated in the appended claims. 

EXAMPLE I 

Purification of a Thermostable Inorganic Pyrophosphatase From 

Thermococcus Litoralis 

Thermococcus litoralis strain NS-C (DSM No. 5473) was grown in the 
media described by Belkin, et al., supra, the disclosure of which is hereby 
incorporated by reference herein, containing 10 g/1 of elemental sulfur in a 100 liter 
fermentor at its maximal sustainable temperature of approximately 80°C for two 
days. The cells were cooled to room temperature, separated from unused sulfur by 
decanting and collected by centrifugation and stored at -70°C The yield of cells 
was 0.8 g per liter. 
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160 g of cells obtained as described above, were suspended in 550 ml of 
buffer A (10 mM Tris buffer, pH 7.5, L0 mM EDTA, 1.0 mM beta- 
mercaptoethanol) containing 0.1 M NaCl and sonicated for 5 minutes at 4°C. The 
lysate was centrifuged at 15,000 g for 30 minutes at 4°C. The supernatant solution 
5 was passed through a 470 ml Affigel blue column (Biorad; Hercules, California). 

The column was then washed with 1000 ml of buffer A containing 0.1 M NaCl. 
The column was eluted with a 2000 ml linear gradient from 0.1 to 2.0 M NaCl in 
buffer A; inorganic pyrophosphatase activity was found in the flow through and 
wash. The Affigel blue flow through/wash (250 ml aliquot) was applied to a 25 ml 
1 0 CM" Sepharose (Pharmacia; Piscataway, New Jersey) column, equilibrated with 

buffer A containing 0.1 M NaCl. The column was washed with 50 ml of buffer A 
containing 0.1 M NaCl, and the column was eluted with a 250 ml linear gradient of 
0.1 to 1.0 M NaCl in buffer A; 100% of the enzyme activity was found in the flow 
through/wash. The CM" Sepharose flow through/wash pool (300 ml) was applied 

1 5 directly to a 27 ml Heparin Sepharose (Pharmacia; Piscataway, New Jersey), 

equilibrated with buffer A containing 0.1 M NaCl. The column was washed with 
60 ml of buffer A containing 0. 1 M NaCl and eluted with a 250 ml linear gradient 
of 0.1 to 1.0 M NaCl; again 100% of the inorganic pyrophosphatase activity was 
found in the flow through/wash. The Heparin flow through/wash (360 ml) was 

2 0 applied to a 42 ml Hydroxylapatite (Biorad; Hercules, California) column, 

equilibrated in buffer A with 0. 1 M NaCl. The column was washed with 100 ml of 
buffer A containing 0.1 M NaCl. The protein was eluted with a 450 ml gradient of 
0 to 20% (NH4)2SC>4. Fractions were tested for enzyme activity. The inorganic 
pyrophosphatase activity eluted as two peaks at 8 - 10 % and 12 - 15 % 

2 5 (NH4)2SC>4 respectively; >90% of the enzyme activity applied to the column was 

found in the second peak. The fractions of the second peak (82 ml) were pooled 
and dialyzed against 4 liters of buffer A containing 0.05 ml NaCl and then applied 
to an 8 ml HPLC mono-Q HR 10/10 column (Pharmacia; Piscataway, New 
Jersey). The inorganic pyrophosphatase activity was eluted with a 160 ml linear 

3 0 gradient of 0.05 to 0.7 M NaCl in buffer A. The inorganic pyrophosphatase activity 

eluted as a single peak at 0.27 M NaCl. The pooled fractions (4 ml) were diluted to 
10 ml with buffer A, dialyzed against 1 liter of buffer A containing 1.2 M 
(NH4)2S04 and applied to an 8 ml HPLC phenyl-superose column (Pharmacia; 
Piscataway, New Jersey). The inorganic pyrophosphatase activity was eluted with 
3 5 a 160 ml linear gradient of 1.2 to 0 M (NH4)2SC>4 in buffer A. The enzyme eluted 
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as a single peak at 0.95 M (NH^SCU. The pooled fractions (12 ml) were dialyzed 
into 2 liters of buffer A containing 0.05 ml NaCl and applied to a 2.0 ml HPLC Q- 
Hyper D column (Biosepra; Marlborough, Massachusetts). The enzyme was eluted 
with a 40 ml linear gradient in buffer A from 0.05 to 0.7 M NaCl. The enzyme 
eluted as two peaks at 0.25 and 0.29 M NaCl respectively and represented 56% of 
the original starting activity. 

By SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and subsequent 
staining of the proteins using a colloidal stain (ISS Problue) more sensitive than 
Coomassie Blue (Neuhoff, et ah, Electrophoresis (1988) 9:255-262, the disclosure 
of which is hereby incorporated by reference herein), it was determined that one of 
the peaks from the Q-Hyper D column was approximately 95% pure with one 
major band at 20 - 21 kDa. FIG 1 . The second Q-Hyper D peak was approximately 
50% pure with two major bands present, one at 50 kDa and the other at 20 - 21 
kDa. These molecular weight determinations were obtained by comparison on the 
same gel to the migration of the following pre-stained marker proteins (New 
England Biolabs, Inc.; Beverly, Massachusetts): MBP-B-galactosidase, 175,000 
Da; MBP-paramyosin 83,000 Da; glutamic dehydrogenase, 60,000 Da; aldolase, 
47,500 Da; triosephophate isomerase, 32,500 Da; B-lactoglobulin A; 25,000 Da, 
lysozyme, 16,500 Da; and aprotinin, 6,500 Da. At this level of purification the 
inorganic pyrophosphatase had a specific activity of 1,400,000 NEB units (or 
approximately 13,000 standard units) of pyrophosphatase activity per mg of 
pyrophosphatase protein and represented 56% of the total activity present in the 
Affigel blue flow through/wash. 

EXAMPLE II 

Cloning of T. Litoralis Inorganic Pyrophosphatase 

A . Identification of Amino Acid Sequence from the Purified T. 
litoralis Inorganic Pyrophosphatase 

The T. litoralis pyrophosphatase, prepared as described above, was 
subjected to electrophoresis and electroblotted according to the procedure of 
Matsudaira, P., J. Biol Chem. (1987) 262:10035-10038, the disclosure of which 
is hereby incorporated by reference herein, with modifications as previously 
described. See Waite-Rees, T. A., et aL, /. Bacterol. (1991) 173:5207-5219 and 
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Looney, M.C., et al., Gene (1989) 80:193-208, the disclosures of which are 
hereby incorporated by reference herein. The membrane was stained with 
Coomassie blue R-250 and the protein band of approximately 20-21 kDa was 
excised and subjected to sequential degradation (Looney, M.C., et al., supra, the 
5 disclosure of which is hereby incorporated by reference herein). The amino 

terminal sequence of the PPase was obtained using an Applied BioSystems 
Division, Perkin-Elmer Corporation (Foster City, California) 470A protein 
sequencer, Brooks, et al., Nucleic Acid Research (1989) 17:979-997, the 
disclosure of which is hereby incorporated by reference herein. The first 37 amino 
1 0 acid residues (SEQ ID NO:l 1) of the 20-21 kDa protein corresponded to: Met- 

Asn-Pro-Phe-His-Asp-Leu-Glu-FYo-Gly-PrcHGlu-Val-Pro-Glu-Val-Val-Tyr-Ala- 
Leu-Ile-Glu-Ile-Pro-Lys-Gly-Ser-Arg-Asn-Lys-Tyr-Glu-Leu-Asp-Lys-Lys-Thr . 

An additional sample of the T. litoralis inorganic pyrophosphatase, 5 jxg in 

1 5 20 was treated with 1 mg of cyanogen bromide (Sigma; St. Louis, Missouri) 

dissolved in 200 |il of 88% distilled formic acid for 24 hours in the dark at room 
temperature. This reaction mixture was evaporated to dryness and resuspended in 
100 \i\ of 5% trifluroacetic acid (TFA) in water. This sample was injected on an 
Applied BioSystems Division, Perkin-Elmer Corporation (Foster City, California) 

2 0 (ABI 172 microbore HPLC system containing a 2.1 x 250 mm, 5 {im particle 300A 

pore size CI 8 column (Vydac; Hesperia, California) equilibrated at 95% solvent A 
(water, 0.1% TFA) and 5% solvent B (acetonitrile, 0.1% TFA). The column was 
developed at flow rate of 200 ^il/min with UV detection at 214 nm for 2 minutes 
followed by a 45 minute linear gradient from 5% B to 60% B and a 10 minute 

2 5 linear gradient from 60% B to 100% B. The major peptide peaks were collected and 

subjected to sequential degradation (Looney, M.C., et al., supra). The first 18 
amino acid residues of the peptide designated C41-19 corresponded to Arg-Glu- 
Pro-Thr-Tyr-Pro-Gly-Val-Leu-Ile-Glu-Ala-Arg-Pro-Ile-Gly-Gly-Phe (SEQ ID 
NO: 12). Residues 13 and 17 were somewhat ambiguous. This fragment was 

3 0 therefore determined not to be derived from the N- terminus of the protein. 

B . Preliminary Mapping of the T. litoralis Inorganic 
Pyrophosphatase 



3 5 



Based on the N- terminus sequence, two 17-oligomers were synthesized on 
an Applied BioSystems 1 automated DNA synthesizer (Applied BioSystems 
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Division, Perkin-Elmer Corporation; Foster City, California) with the following 
sequences: 5* Bt-ATGAAYCCNTTYCAYGA 3' (probe 109-129) (SEQ ID 
NO:13); and 5* Bt-ATHGARATHCCNAARGG 3* (probe 109-130) (SEQ ID 
NO:14). (Y = C or T; N = A, C, G, or T; H = A, C or T; R = A or G). Based on 
the internal sequence from the cyanogen bromide treatment, a 23-mer was made of 
the following complementary DNA sequence: 5' Bt-AGNACNCCNGGRTANG 
TNGGYCC 3' (probe 1 10-144) (SEQ ED NO:15). The oligomer probes were end 
labeled with biotin (Bt) using a biotinylated phosphoroamidite (Applied 
BioSystems Division, Perkiri-Elmer Corporation; Foster City, California) 

The three southern blots were prepared as follows: 200 ng of T. litoralis 
genomic DNA was digested with the restriction endonucleases EcoRl, HindXR, and 
Pst I. The digests were electrophoresed on a 1 .0% agarose gel. The gel was 
washed in 0.25 M HC1 for 30 minutes, then in two changes of 0.5 M NaOH, 1.5 
M NaCl for 15 minutes each and finally in two changes of 0.5 M Tris pH 7.5, 1.5 
M NaCl for 15 minutes each. The DNA was transferred to nylon membrane, 
Immobilon™-S (Millipore; Bedford, Massachusetts), using recommended 
protocols for the MilliBlot™- VP Vacuum Pump (Millipore; Bedford, 
Massachusetts). The membrane was presoaked in 2x SSC and 20x SSC (175.3 g 
NaCl, 88.2 g sodium citrate in 1L of H2O) was used as the transfer buffer. The 
DNA transfer was allowed to proceed for approximately one hour. The membrane 
was dried for 30 minutes at 45° C and ultraviolet crosslinked at 254 nm using a 
power output of 33.00 mJ/cm 2 for 30 seconds with the X-linker-254 (Automated 
BioSystems, Inc. (Essex, Massachusetts)/Owl Scientific, Inc. (Woburn, 
Massachusetts)). 

The nylon blot was pre-hybridized in 5 ml of 6x SSC, 5x Denhardt's 
reagent (0.5 gram ficoll, 0.5 gram polyvinylpyrrolidone, 0.5 gram bovine serum 
albumin), 0.5% SDS and 100 (ag/ml denatured sonicated salmon sperm DNA. 
After prehybridizing at 42° C for one hour, the biotinylated labeled probe was added 
and the hybridization was carried out at 42°C overnight. The blot was washed with 
two changes of 2x SSC, 0. 1 % SDS at room temperature for 5 minutes each; two 
changes of Ix SSC, 0.1% SDS at 42°C for 15 minutes each and finally two 
changes of lx SSC at room temperature for 5 minutes each. The blot was 
developed using chemiluminescent detection according to the standard protocol in 
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NEBlot™ Phototope Detection Kit (New England Biolabs, Inc.; Beverly, 
Massachusetts). The blots were exposed to X-ray film (XAR5, Kodak; Rochester, 
New York) for 20 minutes. 



10 



Results of autoradiography of the above blots revealed high degeneracy of 
probes to be a possible source of difficulty in specifically cloning this PPase. 

C . Using PCR to Amplify Part of the Inorganic Pyrophosphatase 
Gene from T. litoralis Genomic DNA 



Based on the determined amino acid sequence of the protein (SEQ ID 
NO:l 1), new primers were designed for PCR. In order to reduce the primer 
degeneracy, deoxyinosine was substituted in all places where the third position of 
the amino acid was completely ambiguous (A, C, G or T). The primer for the N- 

1 5 terminal was as follows: 5* CTGAATTCCATATGAAYCCNTTYCAYGA 3' 

(primer 1 15-70) (28-mer) (SEQ ID NO: 16); Eco RI and Nde I recognition sites 
were added to the 5* terminus to facilitate cloning; incorporation of one 
deoxyinosine reduced the degeneracy from 32-fold to 8-fold. The sequence of the 
internal primer was: 5' AGNACNCCNGGRTANGTNGGYCC 3' (primer 1 15-71) 

2 0 (23-mer) (SEQ ID NO: 17); the degeneracy of the internal primer was reduced from 

4096-fold to 4-fold with the incorporation of 5 deoxyinosine residues (N = 
deoxyinosine; Y = C or T; R = A or G). 



The T. litoralis inorganic pyrophosphatase gene fragment was amplified 

2 5 using PCR under the following reaction conditions. A 100 ml reaction contained 

100 ng of genomic 7\ litoralis DNA, lx Taq buffer (50 mM KC1, 10 mM Tris- 
HC1, pH 9.0 @ 25°C, 0.1% Triton X-100, 2.5 mM MgCl2), 200 mM dNTP's, the 
primers 1 15-70 and 115-71, and 2.5 units of Taq DNA polymerase. The PCR 
reaction was run on a thermal cycler for forty cycles of denaturing at 95°C for 1 

3 0 minute, annealing at 35*C for 1 minute and extending at 72°C for 1 minute. A 

single 265 bp PCR product was obtained. No PCR products were obtained at 
annealing temperature of 40 W C and 45 - C. The PCR product was purified using a 
Centricon-100 (Amicon; Danvers, Massachusetts) according to manufacturer's 
recommendation. Final concentration of the purified PCR fragment was 
3 5 approximately 15 ng/ml with a total yield of 1.5 mg in a 100 ml volume. 
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D. Identification of Partial Nucleic Acid Sequence of the T. 
Litoralis Inorganic Pyrophosphatase Gene 

The PCR fragment was sequenced on a thermal cycler using the two PCR 
primers (115-70 and 115-71), from Example ILC and primer 109-130, described 
in Example II.B. All three reactions were sequenced according to standard Applied 
BioSystems Division, Perkin-Elmer Corporation (Foster City, California) Taq 
Dyedeoxy™ Terminator Cycle Sequencing Kit protocols. Sequencing reactions 
were set up as follows: 9.5 \i\ reaction pre-mix (16 |il 5X TACS Buffer, 4 \i\ 
dNTP Mix, 4 |il each of DyeDeoxy Terminators (A, G, C and T) and 2 (il 
AmpliTaq® DNA Polymerase), 7.0 \x\ purified PCR fragments and 3.2 pmol 
primer in a reaction volume of 20 pi The sequencing reactions were cycled under 
the following conditions: preheat cycler to 95*C for 5 minutes, 25 cycles of 1 
minute denaturing at 95°C, 2 minutes annealing at 35°C and 1 minute extension at 
72°C. Extension products were purified using Centri-Sep spin columns (Princeton 
Separation Sciences; Princeton, New Jersey) following the recommended spin 
column purification protocol. Samples were run on an Applied BioSystems 
Division, Perkin-Elmer Corporation (Foster City, California) 373A Automated 
DNA Sequencer. Data was analyzed using the Applied BioSystem Division, 
Perkin-Elmer Corporation (Foster City, California) Mac-Assembler Computer 
Programs (Factura™ and Auto Assembler™). 

The consensus sequence of the PCR-generated fragment plus its 
corresponding amino acid sequence can be seen in Figure 2. Comparison of the 
amino acid sequence with other known inorganic pyrophosphatases showed 
homology. Studies of evolutionary conservation among the known amino acid 
sequences for soluble inorganic pyrophosphatase have shown that only functional 
residues appear to be conserved, while other residues are more distantly related. 
See Cooperman, et aL, Trends Bioc hem. ScL (1992) 17:262-266, the disclosure of 
which is hereby incorporated by reference herein. Comparison of the partial amino 
acid sequence for the T. litoralis PPase with the alignment of sequences for S. 
cerevisiae, A. thaliana, E. coli, PS-3 and T. acidophilum in Richter and Schafer, 
Eur. J. Biochem. (1992) 209:351-355, the disclosure of which is hereby 
incorporated by reference herein, showed conservation in the same regions. See 
Figure 3. 
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E. Obtaining and Cloning the Complete T. litoralis Inorganic 
Pyrophosphatase 

PCR has identified the DNA fragment coding for the amino terminal half of 
5 the T. litoralis inorganic pyrophosphatase. In order to find a fragment large enough 

to code for the entire gene, restriction enzyme digests of T. litoralis genomic DNA 
were prepared and were used for probing with the PCR fragment. Restriction 
enzyme digests were performed in separate tubes. Each reaction contained 2 |ig of 
T. litoralis DNA in a total reaction volume of 50 \il Each enzyme (all from New 
1 0 England Biolabs, Inc.; Beverly, Massachusetts) was used in conjunction with its 

recommended reaction buffer. Each reaction contained 20 units of enzyme, after 
incubation for 60 minutes, another 20 units was added and the reaction was 
incubated further for 60 minutes. The following digests were performed: Bam HI, 
Eco RI, Hind III, Kpnl, Pst I, Sac I, Sac II, Sal I, Sma I, Sph I and Xba I. All 

1 5 digests were performed at 37°C except for Sma I which was incubated at 25°C. 

Digests were electrophoresed on an agarose gel and transferred to nylon membrane, 
Immobilon™-S (Millipore; Bedford, Massachusetts) using the method described in 
example II.B. The PCR product was labeled with biotin according to the 
recommended protocol in NEBlot™ Phototope Detection Kit (New England 

2 0 Biolabs, Inc.; Beverly, Massachusetts). The membrane was probed with the biotin- 

labeled PCR fragment and hybridization was detected with an enzyme catalyzed 
light-emitting reaction according to the recommended protocol in NEBlot™ 
Phototope Kit (New England Biolabs, Inc.; Beverly, Massachusetts) The released 
light was subsequently captured on X-ray film. The resulting banding pattern 

2 5 revealed single fragments large enough to encode the entire inorganic 

pyrophosphatase gene (approximately 1,5 -3 kb). Digests in which the biotin- 
labeled PCR fragment hybridized to fragments large enough to contain the entire 
PPase gene were Bam HI (approximately 1600 bp), Eco RI (approximately 1800 
bp), Hind III (approximately 870, 1500 bp), Pst I (approximately 3500 bp) and 

3 0 Sac I (approximately 1300 bp). All other digests revealed fragments greater than 

4000 bp. 

Bam HI, Eco RI and Xba I 7\ litoralis secondary libraries already exist at 
New England Biolabs, Inc. (Beverly, Massachusetts) for screening, as described in 
3 5 U.S. Patent No. 5,210,036 and 5,322,785, the disclosures of which are hereby 

incorporated by reference herein. Alternatively, new libraries can be constructed 
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using standard techniques as described in Sambrook, Fritsch and Maniatis, supra, 
the disclosure of which is hereby incorporated by reference herein. 

An alternative approach is to use inverse PCR on a restriction fragment of 
T. litoralis genomic DNA encoding the entire gene for the inorganic 
pyrophosphatase. This approach would allow the rapid amplification of the DNA 
sequence that flanks the 265 bp region of known sequence. See Ochman, H., et 
al., Genetics (1988), 12:621-623, the disclosure of which is hereby incorporated 
by reference herein. 

Mm HI Library 

A Bam HI genomic library was constructed using X Dashll. X Dashll is a 
Bam HI substitution vector that can be used to clone 10-20 kb Bam HI DNA 
fragments. 25-75 nanograms of T. litoralis genomic DNA digested with Bam HI 
was ligated to 0.5 |ig Bam HI digested, calf intestine phosphatase treated X Dashll 
DNA in 5.0 \il of strand ligation buffer including 0.5 \i\ T4 DNA ligase (New 
England Biolabs, Inc. No. 202). Three [i\ of the ligation reaction were packaged 
using Gigapack Plus (Stratagene; La Jolla, California) according to the 
manufacturer's instructions. After incubation at room temperature for two hours, 
the packaged phage were diluted in 500 |il of SM buffer (SM buffer = 100 mM 
NaCl, 8 mM MgS0 4 , 50 mM Tris pH 7.5, 0.01% gelatin) plus three drops 
chloroform. The packaged Bam HI library was called sample V96. E. coli strain 
ER1458 was used for phage infection. 

Mixed Part ial XvtM Expression Library 

A mixed partial genomic library was constructed using A.gtl L T. litoralis 
DNA was partially digested with 7 different restriction enzymes (Alu I, Dra I, 
HaeUh Hinc II, Pvu II, Rsa I and Ssp I) in the following manner: 4.0 |ig of T. 
litoralis DNA was digested at 37°C with 5.0 units of each enzyme in separate 40 |il 
reactions containing a 50 mM NaCl buffer (10 mM Tris pH 7.5, 50 mM NaCl, 10 
mM MgCl2 and 5 mM BME). Three (il of 100 mM EDTA was added to 15 |il 
samples at 30, 45 and 60 minutes. 0.2 |ig of each digest was electrophoresed on an 
agarose gel to monitor the extent of digestion. Approximately 31 fxg of T. litoralis 
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DNA partials were pooled to form the "mixed partial pool", all reactions were used 
except the Rsa I (45 minutes) and Ssp I (30 minutes) digests; the pool was then 
heated at 65°C for 15 minutes. 

5 40 jxl of the mixed partial pool were ligated to various linkers using 

standard ligation buffer (ligation buffer = 66 mM Tris pH 7.5, 1 mM ATP, 1 mM 
spermidine, 10 mM MgCl 2 , 15 mM DTT, and 2 mg/ml gelatin), 2 |il T4 
polynucleotide kinase (NEB No. 201) and 10 |il T4 DNA ligase (NEB No. 202). 
The ligation was performed at 16 W C overnight. This ligation reaction was processed 
10 by phenol/chloroform extraction, ethanol precipitation and run over a Sepharose CL 

4B column. 0.5 of fraction 1 1 from the Sepharose CL 4B column was ligated to 
0.28 mg of EcoRI cut, bacterial alkaline phosphatase treated Xgtl 1 DNA in a 5.0 |jJ 
reaction using the standard ligation buffer, as described above, and 1.0 |il T4 DNA 
ligase (NEB No. 202). The ligation was performed at 16°C overnight. 4 jal of this 

1 5 ligation reaction were packaged using Gigapack Gold (Stratagene; La Jolla, 

California) according to the manufacturers instructions. After incubation at room 
temperature for two hours, the packaged phage were diluted in 500 |il of SM (SM 
= 100 mM NaCl, 8 mM MgS0 4 , 50 mM Tris pH 7.5, 0.01% gelatin) plus three 
drops of chloroform. The packaged mixed partial library was called sample V6-5 

2 0 and consisted of 1 X 10 4 individual phage. £. coli strain ER1458 was used for 

phage infection. 

Serening Of the Mixed Partial and Bam HI Libraries and Suhelonin f 

gf Inserts 

25 

The Bam HI X Dashll and mixed partial Xgtl 1 phage libraries were 
screened according to standard protocols using the PCR product from Example 
ILB. radiolabeled with ot 32 P. 20 phage (Dl through D10 and Gl through G10) 
which reacted with the probe were picked and 8 were plaque purified. 

3 0 

The 8 PCR-probe positive phage were used to lysogenize E. coli 1458. 
Plate lysate was prepared and phage DNA was purified from liquid culture using 
Qiagen-tip 20 columns (Qiagen; Chattsworth, California) according to the 
manufacturer's recommended protocol for lambda DNA purification. DNA from 
3 5 each ADash II phage (Dl-A, D2-A, D3-A) were digested with Bam HI, while DNA 

from each Xgtl 1 phage (Gl-A, G2-A, G3-A, G4-A) were digested separately with 
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Eco RI and Bsi WI. Double digesting the Xgtl 1 with Kpn VSac I to liberate a DNA 
fragment containing inserted DNA at the Eco RI restriction site was not possible 
since the PPase gene contains internal Sac I sites, as discovered in Example ILD.. 
20 units of each enzyme (all from New England Biolabs, Inc.; Beverly, 
5 Massachusetts) was used in conjunction with its recommended reaction buffer in a 

volume of 50 \il and incubated at 37°C for six hours. Digests were electrophoresed 
on a 1% agarose gel and transferred to nylon membrane, Immobilon™-S 
(Millipore; Bedford, Massachusetts) using the method described in Example II.B.. 
The PCR product of Example ILC. was labeled with biotin according to the 
1 0 recommended protocol in NEB lot™ Phototope Detection Kit (New England 

Biolabs, Inc.; Beverly, Massachusetts). The membrane was probed with the biotin- 
labeled PCR fragment and hybridization was detected with an enzyme catalyzed 
light-emitting reaction according to the recommended protocol in NEB lot™ 
Phototope Detection Kit (New England Biolabs, Inc.; Beverly, Massachusetts). 

1 5 The released light was subsequently captured on X-ray film. The resulting banding 

pattern revealed single fragments that agreed with the Southern blot analysis of 
genomic T. litoralis DNA of Example II.E. and were large enough to encode the 
entire inorganic pyrophosphatase gene (approximately 1.5 - 3.0 kb). Fragments 
that were large enough to contain the entire PPase gene were Dl A/Bam HI 

2 0 (approximately 1600 bp), D3 A/Bam HI (approximately 1600 bp), G 1 A/ Eco RI 

(approximately 1500 bp), G4A/Eco RI (approximately 3000 bp) and Gl A/Bsi WI 
(approximately 2000 bp). 

The above fragments were reobtained by digesting four times the amount of 

2 5 phage DNA as described above in a reaction volume of 100 yi\ with 10 pi of each 

restriction enzyme (New England Biolabs, Inc.; Beverly, Massachusetts). The 
complete digests were run on a 1% agarose gel and excised from the agarose using 
GENE-CLEAN® II, according to the manufacturer's recommended protocol 
(BiolOl; Vista , California). A 2 (il sample of each of the 5 purified phage DNA 

3 0 inserts were run on a 1% agarose gel to check recovery. 

Each insert (0.08 pmoles) was ligated to calf intestinal alkaline phosphatase 
(CIAP) treated pUC19 (0.016 pmoles) digested with Bam HI, Eco RI or Acc 651 to 
give compatible ends in a 20 (J.1 volume containing standard ligation buffer, as 
3 5 described above, and 5 units T4 DNA ligase (New England Biolabs, Inc.; Beverly, 
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Massachusetts, Product No. 202). The ligation was performed at 16*C overnight. 
ER2426 competent cells (Sambrook, Fritsch and Maniatis, supra, the disclosure of 
which is hereby incorporated by reference herein) were transformed with the 
ligation mixture. Of 28 recombinants examined, three looked promising; clones 
F10, F14 and F22 contained inserts large enough to contain the entire T. litoralis 
inorganic pyrophosphatase gene. 

Using Inverse PCR to Identify the Entire Inorganic Pyrophosphatase 
Gene from 7\ litoralis Genomic DNA 



The first step in using IPCR to amplify the entire T. litoralis PPase gene 
was to identify restriction enzymes that would cleave on either side of the T. 
litoralis inorganic pyrophosphatase gene. Fragments less than 2000 bp in size were 
desirable for best possible amplification. Southern blot analysis, as described in 

1 5 Example ILE., indicated four possible candidate restriction enzymes with six-base 

cleavage sites that would produce DNA fragments in the desired size range: BarriHI 
(approximately 1600 bp), Eco RI (approximately 1800 bp), Hind III 
(approximately 870, 1500 bp) and Sac I (approximately 1300 bp). However, 
restriction enzymes with four-base cleavage sites are generally preferred for IPCR 

2 0 because they are more likely to cleave at an acceptable point within the flanking 

target sequence. Bfa I, Hae III, Hha 1 and Sau 3AI were selected because they do 
not cleave within the known 265 bp fragment. See Example MX. Each restriction 
digest contained 10 |ig of T. litoralis genomic DNA in a total reaction volume of 
400 |lL Each restriction enzyme (all from New England Biolabs, Inc.; Beverly, 

2 5 Massachusetts) was used in its recommended reaction buffer, to which 160 - 500 

units of enzyme was added as follows: 400 units Bam HI, 400 units Eco RI, 400 
units Hind III, 400 units Sac I, 200 units Bfa I, 500 units Hae III, 400 units Hha I 
and 160 units Sau 3AI. The mixtures were incubated at 37 °C for 6 hours. The 
digests were phenol/chloroform extracted and ethanol precipitated. Each DNA pellet 

3 0 was resuspended in 400 ^1 of ligase buffer (50 mM Tris-HCl, pH 7.8, 10 mM 

MgCl 2 , 10 mM DTT, 1 mM ATP, 25 |ig/ml BSA), then 5 ^1 T4 DNA ligase (NEB 
No. 202C) was added. The ligation was allowed to proceed at 16°C overnight with 
the hope of forming monomelic circles. The ligase was heat killed for 20 minutes at 
65 # C. The ligation reactions were phenol/chloroform extracted and ethanol 
3 5 precipitated; each pellet was resuspended in 50 |il IX TE (IX TE = 10 mM Tris, 1 

mM EDTA, pH 8.0). 
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The primers used for IPCR were derived from the 265 bp PCR fragment of 
the N-terminal half of the PPase gene. The sequence of the clockwise 24-mer is as 
follows: 5* GGTACGATGATGACGACCCGTTTG 3' ( primer 120-96) (SEQ ID 
NO: 18). The sequence of the counterclockwise primer is: 5' GGATGATTCCATA 
GTCGACCGGG 3' (primer 120-95) (SEQ ID NO: 19). Each IPCR reaction was 
set up as follows: 10 |il of circularized template from the ligation reactions, 200 |iM 
dNTPs, 100 ng primers, 2.5 units AmpliTaq® DNA polymerase (Applied 
BioSystems Division, Perkin-Elmer Corporation; Foster City, California) in PCR 
buffer ( 50 mM KC1, 10 mM Tris-HCl, pH 9.0 @ 25°C, 0.1% Triton X-100, 2.5 
mM MgCl2). Thirty cycles of denaturing at 95 B C for one minute, annealing at 50°C 
for 2 minutes, and extending at 72*C for 2 minutes were run in a thermal cycler. 
IPCR products were obtained from the following circularized templates: Bam HI 
(approximately 1600 bp), Hind III (approximately 1500 bp), Sac I (approximately 
300 bp), Bfa I (approximately 1800 bp) and Sau 3AI (approximately 975 bp). The 
size of the IPCR products from the Bam HI and Hind III ligated circles agreed with 
the results obtained from the Southern blot analysis; the IPCR product from the Sac 
I ligated circle indicated the possibility of two internal Sac I sites due to its 300 bp 
size. No IPCR products were obtained at annealing temperature of 55°C and 60 W C 
Attempts to obtain IPCR products from Eco RI, Hae III and Hha I digests were 
unsuccessful. Each IPCR product was purified using a Centricon-100 (Amicon; 
Danvers, Massachusetts) according to manufacturer's recommendations. See 
Figure 4 for a flow chart illustrating the IPCR process used to obtain the entire 
inorganic pyrophosphatase gene from T. litoralis genomic DNA. 

All the IPCR fragments were sequenced on a thermal cycler using the two 
IPCR primers (120-95 and 120-96), described above. All reactions were sequenced 
according to standard Applied BioSystems Division, Perkin-Elmer Corporation 
(Foster City, California) Taq DyeDeoxy™ Teminator Cycle Sequencing Kit 
protocols. Sequencing reaction were set up as follows: 9.5 |il reaction pre-mix (16 
pj 5X TACS Buffer, 4 \A dNTP Mix, 4 |il each of DyeDeoxy™ Terminators (A, 
G, C and T) and 2 |il AmpliTaq® DNA Polymerase), purified IPCR fragments (50 
- 200 ng) and 3.2 pmol primer in a reaction volume of 20 pi. The sequencing 
reactions were cycled under the following conditions: preheat cycler to 95°C for 5 
minutes, 25 cycles of 1 minute denaturing at 95°C, 2 minutes annealing at 35°C and 
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1 minute extension at 72°C. Extension products were purified using Centri-Sep 
(Princeton Separation Sciences; Princeton, New Jersey) column purification 
protocoL Samples were run on an Applied BioSystems Division, Perkin-Elmer 
Corporation (Foster City, California) Automated 373A DNA Sequencer. Data was 
5 analyzed using the Applied BioSystem Division, Perkin-Elmer Corporation (Foster 

City, California) Mac-Assembler Programs (Factura™ and AutoAssembler™). 

Four additional primers were designed from the DNA sequencing data 
obtained above. Two primers were designed from the Hind III/IPCR product: 5' G 
1 0 TTGAGGGCTGGTCAAAACGC 3' (primer 1078) (21-mer) (SEQ ID NO:20), 

and 5' GCGTTTTGACCAGCCCTCAAC 3' (primer 1079) (21-mer) (SEQ ID 
NO:21); and two primers were designed from the Bam HI/IPCR product: 5' GAG 
ATTGTGCACTTCTTCCAGAGA 3' (primer 1080) (24-mer) (SEQ ID NO:22), 5' 
TCTCTGGAAGAAGTGCACAATCTC 3' (primer 1090) (24-mer) (SEQ ID 

1 5 NO:23). The IPCR products from the Bam HI, Hind III, Bfa I and Sau 3AI ligated 

circles were sequenced as described above with primers 1078, 1079, 1080 and 
1090. The data was analyzed as described above. 

The consensus sequence of the IPCR- generated fragments revealed the 

2 0 entire T. litoralis inorganic pyrophosphatase gene plus flanking DNA sequences 

both up- and down- stream from the PPase gene. The corresponding amino acid 
sequence was compared to the amino acid sequences of other known 
pyrophosphatases (5. cerevisiae, A, thaliana y E. colu PS-3 and T. acidophilum). It 
was confirmed that the entire PPase gene had been synthesized using IPCR, 

2 5 however, the gene was now divided into two pieces, in opposite orientations, on 

opposite ends of the IPCR-generated fragments. 

EXAMPLE III 

3 0 A . Over-Expression of the Thermococcus Litoralis Inorganic 

Pyrophosphatase 



3 5 



The gene fragments obtained through methods in Example II may be used 
in a number of approaches, or combinations thereof, to obtain maximum 
expression of the cloned 7\ litoralis inorganic pyrophosphatase. 
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One such approach comprises separating the T. litoralis inorganic 
pyrophosphatase gene from its endogenous control elements and then operably 
linking the polymerase gene to a very tightly controlled promoter such as a T7 
expression vector (Rosenberg, et al., Gene (1987) 56:125-135, the disclosure of 
which is hereby incorporated by reference herein). Insertion of the strong promoter 
may be accomplished by identifying convenient restriction targets near both ends of 
the 7\ litoralis inorganic pyrophosphatase gene and compatible restriction targets on 
the vector near the promoter, or generating restriction targets using site directed 
mutagenesis (Kunkel, Methods in Enzymology (1987) 154:367, the disclosure of 
which is hereby incorporated by reference herein) and transferring the T. litoralis 
inorganic pyrophosphatase gene into the vector in such an orientation as to be under 
transcriptional and translational control of the strong promoter. 

7\ litoralis inorganic pyrophosphatase may also be overexpressed by 
utilizing a strong ribosome binding site placed upstream of the T. litoralis inorganic 
pyrophosphatase gene to increase expression of the gene. See, Shine and Dalgarno, 
Proc. Natl Acad, Sci. USA (1974) 71:1342-1346, the disclosure of which is 
hereby incorporated by reference herein. 

Another approach for increasing expression of the 7\ litoralis inorganic 
pyrophosphatase comprises altering the DNA sequence of the gene by site directed 
mutagenesis or resynthesis to contain initiation codons that are more efficiently 
utilized than E. coli. 

A further approach for increasing the expression of T. litoralis inorganic 
pyrophosphatase gene comprises designing oligonucleotide primers for 
hybridization to both sides of the T. litoralis inorganic pyrophosphatase gene; 
polymerase chain reaction can then be used to amplify the inorganic 
pyrophosphatase gene with engineered cloning sites on each end. The amplified 
fragment is inserted into one of the above-mentioned expression vectors such as 
pGW7 (ATCC No. 40166) or pET3A from William Studier, (Brookhaven, Nat. 
Lab., Upton, New York). Both vectors contain a strong promoter and a ribosome 
binding site. Even strongly regulated promoters may be subject to read through 
expression, to limit this possibility transcription terminator sequences can be placed 
just upstream of the controlled promoter. Also, an inducible antisense promoter can 
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be placed 3' to the end of the inorganic pyrophosphatase gene to reduce read 
through transcription. 

Finally, T. litoralis inorganic pyrophosphatase may be more stable in 
5 eukaryotic systems like yeast or Baculovirus. 

The T. litoralis inorganic pyrophosphatase may be produced from clones 
carrying the T. litoralis inorganic pyrophosphatase gene by propagation in a 
fermentor in a rich medium containing appropriate antibiotics. Cells are thereafter 
1 0 harvested by centrifugation and disrupted by sonication to produce a crude cell 

extract containing the 7. litoralis inorganic pyrophosphatase activity. 

The crude extract containing the T. litoralis inorganic pyrophosphatase 
activity can be purified by the methods described in Example I, or by standard 

1 5 product purification techniques such as affinity-chromatography, or ion-exchange 

chromatography. 

B . Using Primer Extension Reactions to Amplify the Entire T. 
litoralis Inorganic Pyrophosphatase Gene for Cloning and 

2 0 Over-Expression 

The consensus sequence from the IPCR-generated fragments was used to 
design new primers for the generation of primer extension products for cloning into 
an expression vector such as pMAL-cII, pAII17, pUC18 (all from New England 

2 5 Biolabs, Inc.; Beverly, Massachusetts). The following primers were designed at 

the N-terminus: (1) 5' P -ATGAATCCATTCCACGATTTAGAGCCT 3' (primer 
#1092) (SEQ ID NO:24), a 27-mer phosphorolyated at the 5-end for blunt-end 
cloning into such vectors as pMAL-cII; (2) 5* GGGAATTC CATATG AATCCAT 
TCCACGATTTAGAGCCT 3' (primer #1094) (SEQ ID NO:25), a 38-mer for 

3 0 cloning into an Nde I site in such vectors as pAII17 and (3) 5' CCGGAATTCATG 

AATCCATTCCACGATTTAGAGCCT 3' (primer #1093) (SEQ ID NO:26), a 36- 
mer for cloning into an Eco RI site in such vectors as pUC18. The following 
primer was designed at the C-terminus: 5' CGC GG ATCC TC A CTTCTTG A ATTT 
CTCCTTGTAAAG 3' (primer #1095) (SEQ ID NO:27), a 33-mer for cloning into 
3 5 a Bam HI site. All four primers were separated and excised from a Novex 20% 

polyacrylamide TBE gel; the oligonucleotides were then purified for primer 
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extension using SEP-PAK® CI 8 cartidges (Waters Corporation; Milford, 
Massachusetts). 

The entire T. litoralis inorganic pyrophosphatase gene was synthesized for 
cloning using three different polymerase chain reactions with the following 
combination of primers: (1) 1092 + 1095, (2) 1093 + 1095 and (3) 1094 + 1095. A 
high fidelity thermophilic DNA polymerase with 3'-> 5' proofreading exonuclease 
activity (VentR® DNA polymerase (New England Biolabs, Inc.; Beverly, 
Massachusetts)) was used. In all three cases, the following reaction conditions 
were used. A 100 \xl reaction contained 100 ng of genomic T. litoralis DNA, 1 X 
ThemPol Buffer (10 mM KC1, 20 mM Tris-HCl (pH 8.8 @ 25°C), 10 mM 
(NH4)2S04, 2 mM MgS04, 0.1% Triton X-100), additional 2 mM MgS04, 400 
pM dNTP's, the primers (0.4 |iM) and 1 unit of Vent R ® DNA polymerase. The 
primer extension reactions were run on a thermal cycler with a five minute pre- 
incubation step at 95°C and 20 cycles of denaturing at 95°C for one minute, 
annealing at 55°C for one minute and extending at 72°C for one minute. The 
primer extension reactions were purified using Centricon-lOOs (Amicon; Danvers, 
Massachusetts) according to manufacturer's recommendation. Final concentration 
of the purifed primer extension reactions were approximately 5 ng/|al in a 100 [i\ 
volume. 

One reaction (1092 + 1095) was sequenced on a thermal cycler using the 
following primers: 1092 and 1095, described above, and primers 1080, 1090, 120- 
95 and 120-96, described in Example II.E.. All six reactions were sequenced 
according to standard Applied BioSystems Division, Perkin-Elmer Corporation 
(Foster City, California) Taq Dyedeoxy™ Terminator Cycle Sequencing Kit 
protocols. Sequencing reactions were set up and data analyzed as previously 
described in Example II.D.. The concensus sequence revealed the expected 
complete DNA sequence of the T. litoralis inorganic pyrophosphatase, a 531 bp 
gene coding for a 176 amino acid (19,666 Da) subunit. See Figure 5. 

Restriction digests were performed on all three primer extension products. 
Each reaction contained 200 ng of the primer extension product in a total reaction 
volume of 50 pi. Each restriction enzyme (all from New England Biolabs Inc.; 
Beverly, Massachusetts) was used in conjuction with its recommended reaction 
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buffer. Each reaction contained 10 units of each enzyme, and was incubated for 60 
minutes at 37°C. The following digests were performed on the primer extension 
reactions: 1092 + 1095, Bam HI digest; 1093 + 1095, Eco RI/Bam HI double 
digest; and 1094 + 1095, Nde VBam HI double digest. The restriction enzymes 
5 were then heat-killed at 65°C for 20 minutes; the reactions were phenol/chloroform 

extracted and ethanol precipitated. The resultant pellets were resuspended in 10 |jJ 
of 1 x TE (1 x TE = 10 mM Tris, 1 mM EDTA, pH 8.0) for insertion into a suitable 
expression vector. 

10 C. Cloning the Primer Extension Product into the Vector pAII17 

One preferred mode of expressing the PPase gene is under the control of the 
T7 promotor system. Therefore, the appropriate primer extension product, as 
described in Example III.B., was cloned into the Nde I to Bam HI site on the 

1 5 vector pAII17, a T7 vector based on pETl lc. See Kong, et al., (1993) Journal of 

Biological Chemistry, 268: 1965-1975. The primer extension reaction was ligated to 
pAII17 cut with Nde VBam HI as follows: 240 ng of the primer extension 
generated fragment was mixed with 48 to 72 ng of Nde I /Bam Hl-cleaved and 
dephosphorolyated pAII17 (phenol/chloroformed, ethanol precipitated). The 

2 0 ligation reaction was performed in a volume of 40 p.1 in IX ligation buffer (50 mM 

Tris, pH 7.5, 10 mM MgCl 2 , 10 mM DTT, 0.5 mM ATP). T4 DNA ligase (1 \xl) 
was added and the mixture was allowed to incubate at 16°C for 16 hours. An 
aliquot of 20 jil was used to transform both E. coli strains ER 2497/pLysS and 
ER2497 respectively. After a 2-minute heat shock at 42°C, the cells were diluted 

2 5 with 100 |il of Luria-broth (L-broth) and grown for 20 minutes at 37°C The 

transformed cell cultures were plated onto LB agar plates containing 100 |iig/ml 
ampicillin. After overnight incubation at 37°C, the plates contained 16 colonies, 2 
in the E. coli strain ER2497 and 14 in ER2497/pLysS. 

3 0 To analyze the individual colonies, all 16 colonies from the above 

transformation were picked and plated on LB agar plates containing 100 jag/ml 
ampicillin. The 16 colonies were grown up in 10 ml cultures (L broth + 100 (J.g/ml 
ampicillin) and the plasmids that they carried were prepared by the following 
miniprep purification procedure. Each culture was processed as follows: 3 ml of the 
3 5 overnight culture was pelleted at 12,000xg for 1-2 minutes in a microcentrifuge. 
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The supernatant was poured off and the cell pellet was resuspended in 200 |il of 50 
mM Tris-HCl pH 7.5, 10 mM EDTA, 100 ^ig/ml RNase A. To lyse the cells, 200 
1^1 of 0.2 M NaOH, 1% SDS was added and mixed by inversion. After 5 minutes, 
200 pi of 3M sodium acetate, pH 4.8, was added and mixed by inversion for an 
additional 5 minutes. The precipitate that formed was spun down at 12,000xg at 
4°C for 5 minutes. The supernatant was removed and added to 1 ml Celite (Fluka 
Chemika; Ronkonkoma; New York) slurry (1.5 g Celite suspended in a solution of 
66.84 g guanidine hydrochloride, 33.33 ml 3 M sodium acetate, pH 4 .8, pH 
adjusted to 5.5 using 10 M NaOH, final volume to 100 ml). The resin/DNA mix 
was packed into a minicolumn (Promega; Madison, Wisconsin); the resin was then 
dried by spinning at 12,000xg for 20 seconds in a microcentrifuge. The DNA was 
eluted by applying 50 |il of preheated (70°C - 80°C) 10 mM Tris, 1 mM EDTA, 
pH 8.0, waiting one minute and then spinning at 12,000xg for 20 more seconds. 
The miniprepped DNA (5 was subjected to digestion with Nde VBam HI. One 
clone (T7-5) appeared to have the appropriate size insert for the PPase gene. 

To check for PPase expression and confirm the presence of the correct 
insert, 5 ml cultures (LB/Amp) of 4 colonies were grown, one being #T7-5. A 1 ml 
sample was removed when the cultures reached a density of 2 x 10 8 cells/ml (A<500 
of -0.5); each sample was microcentrifuged for 2 minutes, supernatant discarded 
and cells resuspended in 50 protein gel SDS-PAGE sample buffer (New 
England Biolabs, Inc.; Beverly, Massachusetts, Product No. 7707). IPTG was 
added to the remaining cultures to 0.3 mM and incubated at 37°C with good 
aeration for 2 hours. A 0.5 |il sample was removed, microcentrifuged for 2 
minutes, supernatant discarded and cells resuspended in 100 pal of SDS-PAGE 
sample buffer. The samples were heated in a boiling water bath for 5 minutes and 
then electrophoresed (15 on a 10% SDS-PAGE gel along with a set of protein 
molecular weight standards: MBP-(3-galactosidase, 175 kDa; MBP-paramyosin, 83 
kDa; glutamic dehydrogenase, 62 kDa; aldolase, 47.5 kDa; triosephosphate 
isomerase, 32.5 kDa; (3-lactoglublin A, 25 kDa; lysozyme, 16.5 kDa; and 
aprotinin, 6.5 kDa. After staining the gel with ISS Pro-Blue (Integrated Separation 
Systems; Natick, Massachusetts), a large protein band was visible in the induced 
sample from #T7-5; the size of the induced protein corresponded to the expected 
molecular weight of 20 - 21 kDa for the T. litoralis inorganic pyrophosphatase. 
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The pAII17/PPase clone (T7-5) was subjected to DNA sequencing to 
confirm the correct construct. Plasmid DNA from T7-5 was isolated using standard 
procedures for cesium chloride/ethidium bromide purification. See Sambrook, 
Fritsch, and Maniatis, supra. T7-5 DNA was sequenced on a thermal cycler using 
5 the following primers: T7 universal primer #1248 (New England Biolabs, Inc.; 

Beverly, Massachusetts); primers 1092 and 1095, described in Example III.B.; and 
primers 1080, 1090, 120-95 and 120-96, described in Example H.E.. All seven 
reactions were sequenced according to standard Applied BioSystems Division, 
Perkin-Elmer Corporation (Foster City, California) Taq DyeDeoxy™ Terminator 
1 0 Cycle Sequencing Kit protocols. Sequencing reactions were set up and analyzed as 

described in Example II.D.. The DNA sequences revealed that the pAII17/PPase 
clone T7-5 had the correct T. litoralis inorganic pyrophosphatase promoter gene 
construct. See Figure 6 for a diagram of the pAII17/PPase promoter construct. 

1 5 Although the above-outlined steps represent the preferred mode for 

practicing the present invention, it will be apparent to those skilled in the art that the 
above-described approach can vary in accordance with techniques known in the art. 
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D. Cloning the Primer Extension Product into the Vector pUC18 



Another preferred mode of expressing the PPase gene is in the pUC vector 
series, such as pUC18, a small, high copy number E. coli plasmid cloning vector. 
The appropriate primer extension product, as described in Example III.B., was 
cloned into the Eco RI to Bam HI site. The primer extension reaction was ligated to 

2 5 pUCl 8 cut with Eco RI /Bam HI as follows: 1 60 ng - 240 ng of the primer 

extension generated fragment was mixed with 50 ng of Eco RI I Bam Hl-cleaved 
and dephosphorolyated pUC18 (phenol/chloroformed, ethanol precipitated). Two 
ligation reactions was performed containing either 2 ]i\ or 3 |il of primer extension 
generated fragment, each in a volume of 20 jil in IX ligation buffer (50 mM Tris, 

3 0 pH 7.5, 10 mM MgC^, 10 mM DTT, 0.5 mM ATP). T4 DNA ligase (New 

England Biolabs, Inc.; Beverly, Massachusetts) (0.5 |il) was added and the 
mixtures were allowed to incubate at 16°C for 16 hours. CaCl competent cells (25 
|Lil) of E. coli strain ER2426 were transformed with the ligation mixtures 
(Sambrook, Fritsch and Maniatis, supra, the disclosure of which is hereby 
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incorporated by reference herein). The transformed cell cultures were plated onto 
LB agar plates containing 100 M-g/ml ampicillin and incubated overnight at 37°C 

To analyze the individual colonies, 44 colonies from the above 
transformations were picked and plated on LB agar plates containing ampicillin. 36 
colonies were grown up in 10 ml cultures of LB containing ampicillin (100 p.g/ml) 
and kanamycin (50 ng/ml). The plasmids that they carried were prepared using a 
PEG modified mini alkaline-lysis/PEG precipitation procedure. Each culture was 
processed as follows: 4.5 ml of the overnight culture was pelleted at 12,000xg for 
1-2 minutes in a microcentrifuge. The supernatant was poured off and the cell pellet 
was resuspended in 200 \i\ of GTE buffer (50 mM glucose, 25 mM Tris-HCl pH 
8.0, 10 mM EDTA). To lyse the cells, 300 |il of freshly prepared 0.2 N NaOH, 
1% SDS was added and mixed by inversion. After 5 minutes, 300 (il of 3M 
sodium acetate, pH 4.8, was added and mixed by inversion for an additional 5 
minutes. The precipitate that formed was spun down at 12,000xg at 25°C for 10 
minutes. The supernatant was removed, RNase A was added to a final 
concentration of 20 M-g/ml and incubated at 37°C for 20 minutes. After RNase A 
treatment, the supernatant was extracted twice with 400 |il of chloroform and 
ethanol precipitated. The pellet was resuspended in 32 pJ CIH2O, and precipitated 
with 8.0 |il 4 M NaCl and 40 |il of autoclaved 1 3% PEGgooo on ice for 20 minutes. 
The plasmid DNA was pelleted by centrifugation for 15 minutes at 4°C and rinsed 
with 70% ethanol. The final pellet was resuspended in 20 [i\ dH20. The resultant 
DNA (2 jlxI) was subjected to digestion with Eco RI I Bam HI. Transformants that 
appear to have the proper construct for the T. litoralis PPase gene are pUC18/PPase 
clones 2, 4, 5, 6, 9, 10, 13, 19, 21, 28, 31 and 32. 

To confirm the presence of the correct insert, plasmid DNA from 
pUC18/PPase clones 2, 4, 5 and 6 were sequenced on a thermal cycler using the 
following primers: ml3/pUC sequencing primer #1224 and M13/pUC reverse 
sequencing primer #1233 (both from New England Biolabs, Inc.; Beverly, 
Massachusetts). All eight reactions were sequenced according to standard Applied 
BioSystems Division, Perkin-Elmer Corporation (Foster City, California) Taq 
DyeDeoxy™ Terminator Cycle Sequencing Kit protocols. Sequencing reactions 
were set up and analyzed as described in Example II.D. The DNA sequences 
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revealed that all four pUC18/PPase clones (2, 4, 5, 6) have the correct PPase gene 
construct. See Figure 7 for a diagram of the pUC18/PPase construct. 

EXAMPLE IV 

5 

Purification of Recombinant T. Utoralis Inorganic Pyrophosphatase 

One preferred method for isolation and purification of recombinant T. 
Utoralis inorganic pyrophosphatase is accomplished using the multi-step process as 
1 0 follows: 



E. coli strain ER 2497/pLysS containing the pAII17/PPase construct was 
grown up in 1 liter of Rich broth containing 100 pig/ml ampicillin at 37°C in a 
shaking incubator. The cells were grown to a density of 2 x 10 8 cells/ml (A600 ~ 

1 5 0.5), (exponential growth phase) and induced with a final concentration of 0.3 mM 

IPTG for an additional 2 hours. The cells were harvested by centrifugation at 4000 
xg for 20 minutes, the supernatant was discarded and the cells were resuspended in 
50 ml of buffer A (20 mM Tris, pH 7.5, 10 mM B-mercaptoethanol, and 0.1 mM 
EDTA) containing 50 mM NaCl. The resuspended cells (3 g) were then frozen 

2 0 overnight at -20°C 



The 3 g of cells obtained as described above, were sonicated at 4°C. The 
lysate was centrifuged at 14,000xg for 60 minutes at 4°C. The supernatant was heat 
treated for 15 minutes at 85 °C to denature the less thermostable E. coli host 

2 5 proteins, including the E. coli inorganic pyrophosphatase. After the heat treatment, 

the particulate material was removed by centrifugation at 5000 xg for 10 minutes. 
The remaining supernatant contained approximately 40 mg of soluble protein, or 
approximately 20.5% of the protein contained in the crude extract (196 mg). 

3 0 The remaining supernatant was then filtered using a Nalgene® (Rochester, 

New York) disposable filter (0.2 jam) before applying it to a 5 ml DEAE Sepharose 
FF column (Pharmacia; Piscataway, New Jersey). The column was then washed 
with 25 ml of buffer A containing 50 mM NaCl. The column was eluted with a 50 
ml gradient from 50 mM to 1 M NaCl in buffer A. Fractions were tested for 
3 5 inorganic pyrophosphatase activity using the thin layer chromatography assay, as 

described previously. The active fractions, comprising a 21 ml volume containing 
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40 mg protein, was pooled and is ready for dialysis into a storage buffer such as 
0.1 M KC1, 20 mM Tris (pH 8.0), 0.1 mM EDTA, 1 mM dithiothreitol and 50% 
glycerol at -20°C. 

The T. litoralis inorganic pyrophosphatase obtained above was analyzed by 
SDS-polyacrylamide gel electrophoresis (10 - 20% SDS-PAGE) and subsequent 
staining of the proteins using a colloidal stain (ISS ProBlue; Integrated Separation 
Systems; Natick, Massachusetts). It was determined that the peak from the DEAE 
column was approximately 95% pure with one major band at 20 - 21 kDa, the same 
apparent molecular weight as the native 7. litoralis inorganic pyrophosphatase. See 
Figure 8. Moreover, the PPase obtained above had a specific activity of 1,750,000 
NEB units/mg as determined by the thin layer chromatography assay in which one 
NEB unit is defined as the amount of enzyme that will convert 25 nmoles inorganic 
pyrophosphate at 75°C in one hour in CircumVent™ buffer. Using the modifed 
Taussky and Schorr (Taussky and Schoor, supra, the disclosure of which is hereby 
incorporated by reference herein) assay, as previously described, a specific activity 
of 16,000 - 19,000 units/mg was determined in which one unit is defined as the 
amount of enzyme that will release 1 .0 jamole orthophosphate per minute at 72°C in 
a pH 9.0 Tris buffer. 

The 7\ litoralis inorganic pyrophosphatase expressed from pAII17 and 
obtained through the above purification procedure was found to produce 
approximately 1200-fold the amount of enzyme (mg) per gram of cells when 
compared to the expression of the same enzyme when purified from Thermococcus 
litoralis strain NS-C (DSM NO. 5473) as in Example I. 

Another preferred method for isolation and purification of recombinant 7\ 
litoralis inorganic pyrophosphatase is accomplished using E. coli strain ER2426 or 
similar £. coli strain containing the pUC18/PPase construct and by using a multi- 
step process as follows: 

E. coli strain ER2426 containing the pUC18/PPase construct was grown 
up in 1 liter of LB containing 100 jag/ml ampicillin and 50 ^il/ml kanamycin at 37°C 
in a shaking incubator. The cells were grown overnight to saturation (no IPTG 
induction). The cells were harvested by centrifugation at 4000 xg for 20 minutes, 
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the supernatant was discarded and the cells were resuspended in 50 ml of 50 mM 
NaCl, 20 mM Tris pH 7 .5, 10 mM 6-mercaptoethanol, and 0.1 mM EDTA. The 
resuspended cells (5.47 g) were then frozen at -20 P C 

5 The 5.47 g of cells obtained as described above, were thawed and sonicated 

at 4'C. The lysate was centrifuged at 14,000 xg for 60 minutes at 4°C. The 
supernatant was heat treated for 15 minutes at 85* C to denature the less 
thermostable E. coli host proteins and then centrifuged to remove the particulate 
material (5000 xg for 10 minutes). The remaining supernatant was filtered using a 
1 0 Nalgene® (Rochester, New York) disposable filter (0.2 |im). The remaining 

supernatant contained 23.7 mg of soluble protein, or approximately 5.7% of the 
protein contained in the crude extract (417.5 mg). The supernatant can be further 
purified by using ion-exchange column chromatography, such as DEAE 
Sepharose, or by using any such purification methods known in the art. 

1 5 

The T. litoralis inorganic pyrophosphatase obtained from ER2426 
containing the pUC18/PPase construct was analyzed by SDS-polyacrylamide gel 
electrophoresis (10 - 20% SDS-PAGE) and subsequent staining of the proteins 
using a colloidal stain (ISS ProBlue; Integrated Separation Systems; Natick, 

2 0 Massachusetts); it was determined that the band at 20 - 21 kDa was the recombinant 

T. litoralis inorganic pyrophosphatase which agreed with the apparent molecular 
weight of the native T. litoralis inorganic pyrophosphatase. The T. litoralis 
inorganic pyrophosphatase expressed from pUC18 and obtained through the above 
method was found to produce approximately 25-fold the amount of enzyme (mg) 
2 5 per gram of cells when compared to the expression of the same enzyme when 

purified from Thermococcus litoralis strain NS-C (DSM NO. 5473) as in Example 
I. 
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EXAMPLE V 

Extreme Thermostability of the T. litoralis Inorganic 
Pyrophosphatase 



Temperature dependence of the PPase reaction is shown in Figure 9. The 
3 5 maximum activity of the 7\ litoralis inorganic pyrophosphatase, purified by the 

method described in Example I, was determined to be at 75 °C. Inorganic 
pyrophosphatase activity was measured using the modified method of Taussky and 
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Schorr, supra, the disclosure of which is hereby incorporated by reference herein. 
The inorganic pyrophosphatase (one NEB unit) was incubated at 25°C to 100°C 
for 10 minutes in a 100 \i\ reaction containing 30 mM Tris pH 9.0, L5 mM MgCl2 
and 1.5 mM sodium pyrophosphate. The enzyme is then cooled quickly and 
complexed with 0.9 ml of Taussky-Schorr reagent ( 5.0 g ferrous sulfate, 10 ml 
10% ammonium molybdate in 10 N sulfuric acid, in a final solution volume of 100 
ml.) After 10 minutes, an absorbance reading is taken at 660 nm. Two samples 
were tested at each temperature. 

The extreme thermostability of the T. litoralis inorganic pyrophosphatase 
purified as described in Example I was determined by the following method. 
Purified T. litoralis (approximately 1 NEB unit/10 |al) was preincubated in the 
absence of Mg 2+ at 100°C in 50 mM Tris-HCl pH 9.0 buffer (total volume = 150 
111). A 10 \i\ aliquots were removed at time zero and at 1, 2, 3, and 4 hours. Two 
reaction samples were measured at each time point. The inorganic pyrophosphatase 
activity was measured using the modified Taussky and Schorr, method as 
described above, supra.. 100% of the original pyrophosphatase activity was 
detected at 75°C after incubation at 100°C for 4 hours. 

The heat stability of the T. litoralis inorganic pyrophosphatase is far more 
stable than the other commercially available inorganic pyrophosphatases. 
Commercially available PPases are heat resistant up to the following temperatures: 
Escherichia coli (80°C) (Sigma; St. Louis, Missouri), Baker's yeast (50°C) 
(Sigma, St. Louis, Missouri; Boehringer Mannheim, Indianapolis, Indiana; 
Amersham Corp, Arlington Heights, Illinois), Bacillus stearothermophilus (80°C) 
(Sigma; St. Louis, Missouri). Heat resistance is defined as the temperature at which 
the activity of the enzyme decreases not more than 5% within 10 minutes. See 
Schreier, E. and Hohne, W. E., supra, the disclosure of which is hereby 
incorporated by reference herein. 

The recombinant T. litoralis inorganic pyrophosphatase has the same 
extreme thermostability of the native enzyme. The native and recombinant T. 
litoralis PPases all maintain 100% of their original pyrophosphatase activity at 75 e C 
after incubation at 100°C for 4 hours. Moreover, the temperature dependence of the 
PPase activity was redetermined as described above, using both the native and 
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recombinant T. litoralis inorganic pyrophosphatases (from pAII17/PPase 
construct). The optimum temperature range for the T. litoralis PPase was analyzed 
more closely between 70°C and 75°C; it was determined that the optimum 
temperature was more specifically 72°C and not 75 C C as previously determined. 
5 See Figure 10. 

EXAMPLE VI 

Characterization of T. litoralis Inorganic Pyrophosphatase 

dH optimum 



The activity of the T. litoralis inorganic pyrophosphatase was assayed 
using the modified Taussky- Schorr method as previously described, supra.. 

1 5 Various pH values were tested using both Tris-hydrocholoride (pH 7.5 - 9.5) and 

glycine-NaOH (pH 9.0 - 10.5) buffers, all at a final reaction concentration of 30 
mM. When the T. litoralis PPase (1.3 ng) was assayed for 10 minutes at 72°C in a 
reaction mixture (100 containing 1.5 mM MgCl2 and 1.5 mM PPj, the enzyme 
exhibited a pH optimum of pH 9.5 in both Tris and glycine buffers. The glycine 

2 0 system provided better activity at a pH optimum of 9.5. See Figure 1 1 . 



Molecular mass 



The molecular mass of the T. litoralis inorganic pyrophosphatase was 

2 5 estimated by gel filtration in a Superose 12 HR 10/30 (Pharmacia; Piscataway, 

New Jersey) (10 x 300 mm). The column was equilibrated with 50 mM Tris-HCl 
pH 7.5 containing 100 mM NaCl and run at a flow rate of 0.5 ml/min. The 
Superose 12 HR 10/30 had been previously calibrated with different standard 
proteins: Catalase (232 kDa), Aldolase (158 kDa), Bovine Serum Albumin (67 

3 0 KDa), Ovalbumin (43 kDa), Chymotiypsinogen (25 kDa), and Ribonuclease A 

(13.7 kDa) (all from Pharmacia; Piscataway, New Jersey). From the plot of log 
(molecular mass) versus the distribution coefficient (K a vX a value of 125 kDa was 
calculated for the molecular mass of 7. litoralis inorganic pyrophosphatase. See 
Figure 12. Since the apparent molecular mass of the monomer was determined to 
3 5 be 20 to 21 kDa on an SDS-PAGE and 19,666 Da by the DNA sequence, the 

enzyme is probably homohexameric. The homohexameric structure of the 
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Thermococcus litoralis inorganic pyrophosphatase agrees with other known 
cytoplasmic PPases from eubacterial and archaeal sources which tend to be 
tetrameric or hexameric complexes with single subunits of 19 to 23 kDa. 

Substrate specificity 

The T. litoralis inorganic pyrophosphatase was highly specific for PPi. 
Release of Pi from PPPi, PPPPi, and dADP were 1.8%, 1.8% and 0.3% of that of 
PPi respectively. Hydrolysis of other substrates (pNPP, dATP, dAMP, dCTP, 
dGTP, dTTP) tested were negligible. The above substrates were tested at 
concentrations of 1.5 mM in a 100 p.1 reaction containing 30 mM Tris pH 9.0, 1.5 
mM MgCl2 and 1.3 ng PPase. The pyrophosphatase activity was detected after 10 
minutes at 72°C using the modified Taussky-Schorr assay method as previously 
described, supra. 

TABLE I 

Substrate specificity of inorganic pyrophosphatase from T. litoralis 



Cation Percentage Activity 



PPi 100% 

PPPi 1.8% 

PPPPi 1.8% 

pNPP 0% 

dATP 0% 

dADP 0.3% 

dAMP 0% 

dCTP 0% 

dGTP 0% 

dTTP 0% 



Divalent cation specificity 

The effect of various divalent cations on the catalytic activity of the enzyme 
was investigated. Magnesium provided the optimum activity; no activity was 
observed in the absence of divalent cations. Manganese, cobalt, zinc and nickel 
could be substituted, resulting in 4.6%, 2.8%, 0.9% and 0.6%, respectively, of the 
activity observed with magnesium. No activity was observed with calcium, copper 
or iron. The various divalent cations were tested at concentrations of 1.5 mM in a 
100 reaction containing 30 mM Tris pH 9.0, 1.5 mM sodium pyrophosphate and 
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L3 ng PPase. The pyrophosphatase activity was detected after 10 minutes at 72°C 
using the modified Taussky-Schorr assay method as previously described, supra. 

TABLE II 

5 

Effect of divalent cations on T. litoralis inorganic pyrophosphatase 



Cation Percentage Activity 

None 0% 

Mg2+ 100% 

Mn2+ 4.6% 

Co 2 + 2.8% 

Zn 2 + 0.9% 

Ni2+ 0.6% 

Ca 2 + 0% 

Cu 2 + 0% 

Fe 2 + 0% 



Effect Of Pyrophosphate and Magnesium Concentrations on enzvme 

1 U activity 

Except at very high Mg 2 + concentrations, maximum enzyme activity 
occurred when the Mg 2+ to pyrophosphate ratio was 1:1. In the region of excess 
PPi, inhibition was observed; excess Mg 2+ was also inhibitory. 

15 

The inorganic pyrophosphatase from T. litoralis did not appear to follow 
simple Michaelis-Menten kinetics when comparing reaction rates at different PPi 
and magnesium concentrations. However, if the initial velocities for low substrate 
concentrations of PPi and Mg 2+ and a Mg 2+ /PPj ratio of one is used then the 

2 0 corresponding Lineweaver-Burke plot yields a straight line. The apparent K m value 

for the enzyme with respect to either Mg 2 + or PPi was calculated to be 1 .0 mM. 

Comparison of amino acid alignment to other known PPases 

2 5 The complete amino acid sequence of the T. litoralis inorganic 

pyrophosphatase was compared to that of other known PPases. T. litoralis PPase 
exhibits similarity to the following known PPases: T. acidophilum (49%), 
thermophilic eubacterium PS-3 (49%), S. acidocaldarius (46%), A. thaliana (46%), 
E. coli (41%), T. thermophilus (39%), S. cerevisiae (27%). The similarity 

3 0 percentage is based on the percentage of identical residues with respect to the 
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shorter of each sequence pair. All sequences were extracted from GENB ANK, 
EMBL or SWISSPROT except for S. acidocaldarius. See Meyer, et. al. (1995), 
supra. 

5 PROSITE document PDOC00325 lists an inorganic pyrophosphatase 

signature consensus pattern that is shared by ALL known sequences of this class as 
of August 25, 1995: 

D - [SGN] - D - [PE] - [LIVM] - D - [LIVMGC] 
1 0 residue: 1 2 3 4 5 6 7 

It is believed that the three aspartic acid residues (D, underlined) bind 
divalent metal cations. The inorganic pyrophosphatase from Thermococcus litoralis 
does not completely fit this pattern: instead the amino acid sequence for 7. litoralis 

1 5 PPase in this region is: 

D-D-D-D-P-F-D-I 
residue: -1 1 2 3 4 5 6 7 

2 0 There are three changes to note when comparing the T. litoralis amino acid 

sequence to the inorganic pyrophosphatase signature concensus pattern found in 
PROSITE. First, there is an additional aspartic acid (D) preceding the concensus 
pattern which is not found in any of the other known PPase. Second, the amino 
acid residue in position 2 is an aspartic acid (D) and not serine (S), glycine (G) or 
2 5 asparagine (N). Third, the amino acid residue in position 5 is a phenylalanine (F) 

and not leucine (L), isoleucine (I), valine (V) or methionine (M). While all known 
pyrophosphatases have 3 aspartic acid (D) residues in this signature region to bind 
divalent cations, the inorganic pyrophosphatase from 7\ litoralis has an additional 2 
aspartic acids to give a total of 5 aspartic acid residues in this signature region. 
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EXAMPLE VII 

Performance of T. litoralis Inorganic Pyrophosphatase in DNA 

Sequencing 

The performance of the thermostable inorganic pyrophosphatase of the 
present invention was examined in thermal cycle DNA sequencing reactions. The 
PPase from T. litoralis strain NS-C (DSM No. 5473), prepared as described in 
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example I, was used in standard Applied BioSystems Division, Perkin-Elmer 
Corporation (Foster City, California) Taq Dye Primer Cycle Sequencing. Two 
sequencing reactions were performed with and without the addition of PPase (1 
NEB unit). Both reactions were sequenced according to standard Applied 
BioSystems Division, Perkin-Elmer Corporation (Foster City, California) Taq Dye 
Primer Cycle Sequencing Kit protocols using M13mpl8 templates and primer 
-21M13. The only change to the recommended protocol was that for the PPase 
reaction, one NEB (New England Biolabs, Inc.; Beverly, Massachusetts) unit of 
PPase was added to the diluted Taq reaction mix before the dye primer reactions 
were prepared. The two sequencing reactions were cycled under the following 
conditions: preheat cycler to 95°C for 5 minutes, 15 cycles of 30 seconds 
denaturing at 95°C, 30 seconds annealing at 55°C and 1 minute extension at 70X. 
Extension products were purified using Centri-Sep spin columns (Princeton 
Separations, Inc.; Adelphia, New Jersey). Samples were run on an Applied 
BioSystems Division, Perkin-Elmer Corporation (Foster City, California) 373A 
DNA Sequencer. The DNA sequence traces show that the addition of T. litoralis 
PPase solves some sequencing artifacts present in the identical reaction performed 
without inclusion of the T. litoralis pyrophosphatase. See FIG. 13. The M13mpl8 
sequence without PPase shows dropout peaks at position 164 and position 310. 
When PPase is added to the reaction, these two peaks are resolved. 

The recombinant T. litoralis PPase purified from E. coli strain 
ER2497/pLysS containing the pAII17/PPase construct, prepared as described in 
example IV, was also examined in standard Applied BioSystems Division, Perkin- 
Elmer Corporation (Foster City, California) Taq Dye Primer Cycle Sequencing. 
Sequencing reactions were set up exactly as described above. The DNA sequence 
traces show that the addition of T. litoralis PPase solves the problem of specific 
fragments disappearing on DNA sequencing gels. See Figure 14. The ml3mpl8 
sequence without PPase shows missing G and A peaks at position 316 and 317 
respectively. When the same ml3mpl8 template is sequence with the addition of 
PPase (2 NEB units), the peaks are resolved, here shown at position 310 and 311. 
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Missing sequencing peaks are thought to be due to pyrophosphorolysis, 
wherein the presence of excess inorganic pyrophosphate causes the hydrolysis of 
the 3 -terminal base. See generally, Deutsher et al., J. Biol. Chem. (1969) 
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244:3019 and Kornberg, DNA Replication , pp. 125-126, published by Freeman & 
Co., SF, the disclosure of which are hereby incorporated by reference herein. 
Inclusion of T. litoralis inorganic pyrophosphatase in a thermal cycle DNA 
sequencing reaction reduces the level of pyrophosphate and improves the 
uniformity of band intensity of nearby bands, thus improving reading of DNA 
sequencing gels. 



The weak or missing fragments due to pyrophosphorolysis appears to be 
DNA sequence dependent, occurring at very specific sites. For example, the 
sequence dependence of the missing G peak in an ml3mpl8 template using 
Applied BioSystems Division, Perkin-Elmer Corporation (Foster City, California) 
Taq Dye Primer Cycle Sequencing is 5'-AGTGAG-3\ where the underlined "G" 
disappears due to pyrophosphorolysis, but returns when T. litoralis PPase is 
added to the reaction. 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: LENNOX, Tricia L. 

SLATKO, Barton E. 
SEARS, Lauren E. 

(ii) TITLE OF INVENTION: PURIFIED THERMOSTABLE INORGANIC 
PYROPHOSPHATASE OBTAINABLE FROM THERMOCOCCUS 
LITORALIS 

(iii) NUMBER OF SEQUENCES: 28 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NEW ENGLAND BIOLABS, INC. 

(B) STREET: 32 TOZER ROAD 

(C) CITY: BEVERLY 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

(F) ZIP: 01915 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/329,721 

(B) FILING DATE: 25-OCT-1994 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: WILLIAMS, GREGORY D. 

(B) REGISTRATION NUMBER: 30 901 

(C) REFERENCE /DOCKET NUMBER: NEB-1 05-PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (508) 927-5054 

(B) TELEFAX: (508) 927-1705 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTAGAGCCTG GACCGGAAGT ACCGGAAGTT GTTTACGCCT TAATAGAGAT TCCAAAGGGG 60 

AGCAGAAACA AGTATGAGCT TGACAAAAAG ACCGGTCTTA TAAAGCTCGA TAGAGTTCTT 12 0 

TACAGNCCAT TCCACTACCC GGTCGACTAT GGAATCATCC CACAAACATG GTACGATGAT 18 0 

GACGACCCGT TTGACATCAT GGTCATAATG AGGGAGCCGA CC 222 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Leu Glu Pro Gly Pro Glu Val Pro Glu Val Val Tyr Ala Leu lie Glu 
15 10 15 

lie Pro Lys Gly Ser Arg Asn Lys Tyr Glu Leu Asp Lys Lys Thr Gly 
20 25 30 

Leu lie Lys Leu Asp Arg Val Leu Tyr Ser Pro Phe His Tyr Pro Val 
35 40 45 
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Asp Tyr Gly lie lie Pro Gin Thr Trp Tyr Asp Asp Asp Asp Pro Phe 
50 55 60 

Asp lie Met Val lie Met Arg Glu Pro Thr 
65 70 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Thr Tyr Thr Thr Arg Gin lie Gly Ala Lys Asn Thr Leu Glu Tyr Lys 
1 5 " 10 15 

Val Tyr lie Glu Lys Asp Gly Lys Pro Val Ser Ala Phe His Asp He 
20 25 30 

Pro Leu Tyr Ala Asp Lys Glu Asn Asn He Phe Asn Met Val Val Glu 
35 40 45 

lie Pro Arg Trp Thr Asn Ala Lys Leu Glu lie Thr Lys Glu Glu Thr 
50 55 60 

Leu Asn Pro He He Gin Asp Thr Lys Lys Gly Lys Leu Arg Phe Val 
65 70 75 80 

Arg Asn Cys Phe Pro His His Gly Tyr He His Asn Tyr Gly Ala Phe 
85 90 95 

Pro Gin Thr Trp Glu Asp Pro Asn Val Ser His Pro Glu Thr Lys Ala 
100 105 HO 

Val Gly Asp Asn Asp Pro He Asp Val Leu Glu He Gly Glu Thr He 
115 120 125 

Ala Tyr Thr Gly Gin Val Lys Gin Val Lys Ala Leu Gly He Met Ala 
130 135 140 

Leu Leu Asp Glu Gly Glu Thr Asp Trp Lys Val He Ala He Asp lie 
145 150 155 160 

Asn Asp Pro Leu Ala Pro Lys Leu Asn Asp He Glu Asp Val Glu Lys 
165 170 175 

Tvr Phe Pro Gly Leu Leu Arg Ala Thr Asn Glu Trp Phe Arg He Tyr 
Y 180 185 190 

Lys He Pro Asp Gly Lys Pro Glu Asn Gin Phe Ala Phe Ser Gly Glu 
195 200 205 
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Ala Lys Asn Lys Lys 
210 



Ser Trp Lys Gin Leu 
225 



Asp Leu Thr Asn Val 
245 



Ala Ser Asp Ala lie 
260 



Asp Lys Ser lie Asp 

275 



Tyr Ala Leu Asp lie lie 
215 



lie Ala Gly Lys Ser Ser 
230 235 

Thr Leu Pro Asp Thr Pro 
250 

Pro Pro Ala Ser Leu Lys 
265 

Lys Trp Phe Phe lie Ser 
280 



Lys Glu Thr His Asp 
220 

Asp Ser Lys Gly lie 
240 

Thr Tyr Ser Lys Ala 
255 

Ala Asp Ala Pro lie 
270 



Gly Ser Val 
285 



(2) INFORMATION FOR SEQ ID NO : 4 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Ala Glu lie Lys Asp Glu Gly Ser Ala Lys Gly Tyr Ala Phe Pro 
15 10 15 

Leu Arg Asn Pro Asn Val Thr Leu Asn Glu Arg Asn Phe Ala Ala Phe 
20 25 30 

Thr His Arg Ser Ala Ala Ala His Pro Trp His Asp Leu Glu lie Gly 
35 40 45 

Pro Glu Ala Pro Thr Val Phe Asn Cys Ala Val Glu lie Ser Lys Gly 
50 55 60 

Gly Lys Val Lys Tyr Glu Leu Asp Lys Asn Ser Gly Leu lie Lys Val 
65 70 75 80 

Asp Arg Val Leu Tyr Ser Ser lie Val Tyr Pro His Asn Tyr Gly Phe 
85 90 95 

lie Pro Arg Thr lie Cys Glu Asp Ser Asp Pro Met Asp Val Leu Val 
100 105 110 

Leu Met Gin Glu Pro Val Leu Thr Gly Ser Phe Leu Arg Ala Arg Ala 
115 120 125 

lie Gly Leu Met Pro Met lie Asp Gin Gly Glu Lys Asp Asp Lys lie 
130 135 140 



lie Ala Val Cys Ala Asp Asp Pro Glu Phe Arg His Tyr Arg Asp lie 
145 150 155 160 
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Lys Glu Leu Pro 



Asp Tyr Lys Lys 
18 0 

Pro Ala Gin Ala 
195 

Glu Leu Thr Ser 
210 

Phe Pro Phe Leu 
225 

Thr Thr Cys Met 



Met Leu lie Arg 

260 



Pro His Arg Leu 
165 

Asn Glu Asn Lys 



Ala lie Asp Ala 
200 

Lys Leu Ala Cys 
215 

Pro Val Cys Leu 
230 

Leu Asp Lys lie 
245 

Lys His Cys 



Ala Glu lie Arg 
170 

Lys Val Asp Val 
185 

lie Lys Asp Ser 



Asn Ala Asn Glu 

220 

Asp lie Thr Glu 
235 

Ser lie Gly Ala 
250 



Arg Phe Phe Glu 
17 5 

Glu Ala Phe Leu 
190 

Met Asp Leu Tyr 

205 

Glu Thr Ser Pro 



Ala Ala Phe Tyr 
240 

Phe Asn Phe Val 
255 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Ser Leu Leu Asn Val Pro Ala Gly Lys Asp Leu Pro Glu Asp lie Tyr 
15 10 15 

Val Val lie Glu lie Pro Ala Asn Ala Asp Pro lie Lys Tyr Glu lie 
20 25 30 

Asp Lys Glu Ser Gly Ala Leu Phe Val Asp Arg Phe Met Ser Thr Ala 
35 40 45 

Met Phe Tyr Pro Cys Asn Tyr Gly Tyr lie Asn His Thr Leu Ser Leu 
50 ~ 55 60 

Asp Gly Asp Pro Val Asp Val Leu Val Pro Thr Pro Tyr Pro Leu Gin 
65 ' 70 75 80 

Pro Gly Ser Val lie Arg Cys Arg Pro Val Gly Val Leu Lys Met Thr 
8 5 90 95 

Asp Glu Ala Glv Glu Asp Ala Lys Leu Val Ala Val Pro His Ser Lys 
100 105 HO 

Leu Ser Lys Glu Tyr Asp His lie Lys Asp Val Asn Asp Leu Pro Glu 
115 120 125 
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Leu Leu Lys Ala Gin lie Ala His 
130 135 

Glu Lys Gly Lys Trp Val Lys Val 
145 150 

Ala Lys Ala Glu lie Val Ala Ser 
165 



Phe Phe Glu His Tyr Lys Asp Leu 
140 

Glu Gly Trp Glu Asn Ala Glu Ala 

155 160 

Phe Glu Arg Ala Lys Asn Lys 
170 175 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ala Phe Glu Asn Lys lie Val Glu Ala Phe lie Glu lie Pro Thr Gly 
15 10 15 

Ser Gin Asn Lys Tyr Glu Phe Asp Lys Glu Arg Gly lie Phe Lys Leu 
20 25 30 

Asp Arg Val Leu Tyr Ser Pro Met Phe Tyr Pro Ala Glu Tyr Gly Tyr 
35 40 45 

Leu Gin Asn Thr Leu Ala Leu Asp Gly Asp Pro Leu Asp lie Leu Val 
50 55 60 

lie Thr Thr Asn Pro Pro Phe Pro Gly Cys Val lie Asp Thr Arg Val 
65 70 ^75 ~ 80 

lie Gly Tyr Leu Asn Met Val Asp Ser Gly Glu Glu Asp Ala Lys Leu 
85 90 95 

lie Gly Val Pro Val Glu Asp Pro Arg Phe Asp Glu Val Arg Ser lie 
100 105 110 

Glu Asp Leu Pro Gin His Lys Leu Lys Glu lie Ala His Phe Phe Glu 
115 120 125 

Arg Tyr Lys Asp Leu Gin Gly Lys Arg Thr Glu lie Gly Thr Trp Glu 
130 135 140 

Gly Pro Glu Ala Ala Ala Lys Leu lie Asp Glu Cys lie Ala Arg Tyr 
145 150 155 160 

Asn Glu Gin Lys 



(2) INFORMATION FOR SEQ ID NO : 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Glu Ser Phe Tyr His Ser Val Pro Val Gly Pro Lys Pro Pro Glu 
15 10 15 

Glu Val Tyr Val lie Val Glu lie Pro Arg Gly Ser Arg Val Lys Tyr 
20 25 30 

Glu lie Ala Lys Asp Phe Pro Gly Met Leu Val Asp Arg Val Leu Tyr 
35 40 45 

Ser Ser Val Val Tyr Pro Val Asp Tyr Gly Leu lie Pro Arg Thr Leu 
50 ^ 55 60 

Tyr Tyr Asp Gly Asp Pro Met Asp Val Met Val Leu lie Ser Gin Pro 
65 " ' - 70 75 80 

Thr Phe Pro Glv Ala lie Met Lys Val Arg Pro lie Gly Met Met Lys 
85 90 95 

Met Val Asp Gin Gly Glu Thr Asp Asn Lys lie Leu Ala Val Phe Asp 
100 105 HO 

Lys Asp Pro Asn Val Ser Tyr lie Lys Asp Leu Lys Asp Val Asn Ala 
115 120 125 

His Leu Leu Asp Glu lie Ala Asn Phe Phe Ser Thr Tyr Lys lie Leu 
130 135 140 

Glu Lys Lys Glu Thr Lys Val Leu Gly Trp Glu Gly Lys Glu Ala Ala 
145 ~ 150 155 160 

Leu Lys Glu lie Glu Val Ser He Lys Met Tyr Glu Glu Lys Tyr Gly 
165 1*70 175 

Lys Lys Asn 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Asn Pro Phe His Asp Leu Glu Pro Gly Pro Glu Val Pro Glu Val 
15 10 15 

Val Tyr Ala Leu He Glu He Pro Lys Gly Ser Arg Asn Lys Tyr Glu 
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Leu Asp Lys Lys Thr Gly Leu lie Lys Leu Asp Arg Val Leu Tyr Ser 
35 40 45 

Pro Phe His Tyr Pro Val Asp Tyr Gly lie lie Pro Gin Thr Trp Tyr 
50 55 60 

Asp Asp Asp Asp Pro Phe Asp lie Met Val lie Met Arg Glu Pro Thr 
65 70 75 80 

Tyr Pro Gly Val Leu lie Glu Ala Arg Pro lie Gly Gly Phe 
85 90 

(2) INFORMATION FOR SEQ ID NO : 9 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGAATCCAT TCCACGATTT AGAGCCTGGA CCGGAAGTAC CGGAAGTTGT TTACGCCTTA 60 

ATAGAGATTC CAAAGGGGAG CAGAAACAAG TATGAGCTTG ACAAAAAGAC CGGCCTTATA 12 0 

AAGCTCGATA GAGTTCTTTA CAGCCCATTC CACTACCCGG TCGACTATGG AATCATCCCA 180 

CAAACATGGT ACGATGATGA CGACCCGTTT GACATCATGG TCATAATGAG GGAGCCAACA 2 40 

TATCCGGGAG TTCTTATTGA GGCAAGACCA ATAGGCCTCT TCAAGATGAT AGACAGCGGC 300 

GACAAGGACT ACAAGGTATT GGCAGTTCCA GTGGAAGATC CCTACTTTAA TGACTGGAAG 3 60 

GACATAAGCG ACGTTCCGAA GGCTTTCCTT GACGAGATTG CGCACTTCTT CCAGAGATAC 42 0 

AAAGAGCTCC AAGGTAAGGA AATCATTGTT GAGGGCTGGG AAAAC GC AG A GAAGGCAAAG 4 80 

CAAGAAATAC TTAGGGCAAT AG A AC T T T AC AAGGAGAAAT TCAAGAAGTG A 531 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asn Pro Phe His Asp Leu Glu Pro Gly Pro Glu Val Pro Glu Val 
15 10 15 



Val Tyr Ala Leu lie Glu lie Pro Lys Gly Ser Arg Asn Lys Tyr Glu 
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20 25 30 

Leu Asp Lys Lys Thr Gly Leu lie Lys Leu Asp Arg Val Leu Tyr Ser 
35 ~ 40 " 45 

Pro Phe His Tyr Pro Val Asp Tyr Gly lie lie Pro Gin Thr Trp Tyr 
50 55 60 

Asp Asp Asp Asp Pro Phe Asp lie Met Val lie Met Arg Glu Pro Thr 
65 70 75 80 

Tyr Pro Gly Val Leu lie Glu Ala Arg Pro lie Gly Leu Phe Lys Met 
85 90 95 

lie Asp Ser Gly Asp Lys Asp Tyr Lys Val Leu Ala Val Pro Val Glu 
100 105 110 

Asp Pro Tyr Phe Asn Asp Trp Lys Asp lie Ser Asp Val Pro Lys Ala 
115 120 125 

Phe Leu Asp Glu lie Ala His Phe Phe Gin Arg Tyr Lys Glu Leu Gin 
130 135 140 

Gly Lys Glu lie lie Val Glu Gly Trp Glu Asn Ala Glu Lys Ala Lys 
145 " 150 ~ " 155 160 

Gin Glu lie Leu Arg Ala lie Glu Leu Tyr Lys Glu Lys Phe Lys Lys 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 

Met Asn Pro Phe His Asp Leu Glu Pro Gly Pro Glu Val Pro Glu Val 
15 10 15 

Val Tyr Ala Leu lie Glu lie Pro Lys Gly Ser Arg Asn Lys Tyr Glu 
20 25 30 

Leu Asp Lys Lys Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg Glu Pro Thr Tyr Pro Gly Val Leu He Glu Ala Arg Pro He Gly 
1 5 10 15 

Gly Phe 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note- "Y - C or T" 
<ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "N = C, T, A or G" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note* " Y - C or T" 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- "Y - C or T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGAAYCCNT TYCAYGA 



(2) INFORMATION FOR SEQ ID NO : 1 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note- "H = A, C or T" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "R - A or G" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 9 
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<D) OTHER INFORMATION: /note= "H = A, C or T" 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note" "N = C, T, A or G" 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note- "R = A or G" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATHGARATHC CNAARGG 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "N = C, T, A or G" 
(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note= "N = C, T, A or G" 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note= "N = C, T, A or G" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 12 
(D) OTHER INFORMATION: /note= 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 15 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 18 
(D) OTHER INFORMATION: /note= 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /note- "Y ■= C or T 1 



A or G" 



/note= "N = C, T, A or G" 



"N = C, T, A or G" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



AGNACNCCNG GRTANGTNGG YCC 



(2) INFORMATION FOR SEQ ID NO: 16: 



WO 96/12798 



PCT/US95/13662 



(o\ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /note= "Y * C or T" 
<ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 2 0 

(D) OTHER INFORMATION: /note- "N - deoxyinosine" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 23 

(D) OTHER INFORMATION: /note* "Y - C or T" 
(ix) FEATURE: 

(A) NAME / KEY : 

(B) LOCATION: 2 6 

(D) OTHER INFORMATION: /note- "Y - C or T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTGAATTCCA TATGAAYCCN TYCAYGA 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note* "N * deoxyinosine" 
(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note* "N * deoxyinosine" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note- "N * deoxyinosine" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note* "R = A or G" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note* "N » deoxyinosine" 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 18 
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(D) OTHER INFORMATION: /note- "N - deoxyinosine" 
(ix) FEATURE: 

(A) NAME / KEY : 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /note- f, Y - C or T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGNACNCCNG GRTANGTNGG YCC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GGTACGATGA TGACGACCCG TTTG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGATGATTCC ATAGTCGACC GGG 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GTTGAGGGCT GGTCAAAACG C 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GCGTTTTGAC CAGCCCTCAA C 21 



(2) INFORMATION FOR SEQ ID NO:22: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAGATTGTGC ACTTCTTCCA GAGA 2 4 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TCTCTGGAAG AAGTGCACAA TCTC 2 4 



(2) INFORMATION FOR SEQ ID NO : 2 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

TGAATCCATT CCACGATTTA GAGCCT 2 6 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GGGAATTCCA TATGAATCCA TTCCACGATT TAGAGCCT 38 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

CCGGAATTCA TGAATCCATT CCACGATTTA GAGCCT 3 6 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CGCGGATCCT CACTTCTTGA ATTTCTCCTT GTAAAG 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GAATTGTGAG CGCTCACAAT TCTAGGATGT TAATTGCGCC GACATCATAA CGGTTCTGGC 

AAATATTCTG AAATGAGCTG TTGACAATTA ATCATCGGCT CGTATAATGT GTGGAATTGT 

GAGCGGATAA CAATTTCACA CAGGAAACAG ACCATGGTGA ATTCTAGAGC TCGAGGATCC 

GCGGTACCCG GGCATGCATT CGAAGCTTCC TTAAGCGGCC GTCGACCGAT GCCCTTGAGG 



CCTTCAACCA 



250 
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WHAT IS CLAIMED IS: 

1 . A purified thermostable enzyme obtainable from Thermococcus litoralis 
which catalyzes the hydrolysis of inorganic pyrophosphatase, wherein said 
enzyme has a molecular weight of 20-21 kDa when compared with protein 
standards including trypsin inhibitor (soybean) - 21.5 kDa. 

2. The thermostable enzyme of claim 1, wherein the enzyme is active after four 
hours incubation at 100°C and has 100% of its activity at 72°C. 

3 . The thermostable enzyme of claim 1, wherein said enzyme is isolated from 
a recombinant organism transformed with a vector that codes for the 
expression of the enzyme. 

4. Isolated DNA which codes for the thermostable enzyme of claim 1. 

5 . The isolated DNA of claim 4, wherein the DNA includes the nucleotide 
sequence (SEQ ID NO:9). 

6. A cloning vector comprising the isolated DNA of claim 1. 

7 . A host cell transformed by the vector of claim 6. 

8 . A method for producing a recombinant thermostable DNA inorganic 
pyrophosphatase from Thermococcus litoralis comprising culturing a host 
cell transformed with the vector of claim 6 under conditions suitable for the 
expression of the DNA pyrophosphatase. 
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PURITY OF PREPARATIONS OF T. LITORALIS 
INORGANIC PYROPHOSPHATASE 



FIG. I 




PURIFICATION OF RECOMBINANT T. LITORALIS 
INORGANIC PYROPHOSPHATASE 
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IPCR of the Inorganic Pyrophosphatase Gene from T. litoralis Genomic DNA 
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Effect of Temperature on the Activity of T. litoralis PPase 
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Effect of Temperature on the Activity of the Native and Recombinant 
T. litoralis Inorganic Pyrophosphatase 
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Effect of pH on the Activity of the Recombinant T. lit oralis 
Inorganic Pyrophosphatase 



■ Tris-HCI buffer 
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Molecular Weight Determination of T. litoralis PPase 
on Superose 12 HR 10/30 
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Partial trace data from an ml3mpl8 template sequenced with 
AmpliTaq DNA polymerase (no T. litoralis PPase) 



TCCATGCC TGCAGGT C G AC TC TAG A G GATCCCCGG GTACC A GNCTCGAA 
10 20 30 40 50 




AAGC AT AAA G TG TA AAGCC TG G G G TGC C TA ATG AG TG AGCTA ACTCAC 
130 140 150 160 170 
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Partial trace data from an ml3mpl8 template sequenced with 
AmpliTaq DNA polymerase (no T. litoralis PPase) 



TCGTA A TCATG 6TC ATAGC TG TTTCC TG TG TG AAAT TGT T A TCC G C 
60 70 80 90 100 




ATT A ATTGCGTTGCGCTC ACTGCC CGCTTTCC AGTCGG G AAACCTG 
180 190 200 210 220 




T TTCT TTT CACCA GT NAG AC G GGCA ACAGCTG ATTG CCC TT C ACC 
300 310 320 330 




FIG. I3A-2 
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Partial trace data from an ml3mpl8 template sequenced with AmpliTaq 
DNA polymerase in the presense of T. litoralis inorganic pyrophosphatase 

GC TTGCATGCCTGCAGGTC G ACTCT AGAG GATCCCCGG GTACC AGACTCG 
10 20 30 40 50 




CGGCCA ACG CGCGGGGAGAGGCGGTTTGCGT ATTG G GCGCCAGG G TG 
250 260 270 280 290 




FIG. I3B- I 
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Partial trace data from an m!3mpl8 template sequenced with AmpliTaq 



DNA polymerase in the presense of T. litoralis inorganic pyrophosphatase 




TCACAT TA ATTG C G TTG CGCTC AC TG CCC GC TTTCC AG TCGGG AAA 
180 190 200 210 




G TTTTTCTTTTCACCAGTG AG AC G G GCA AC A GCTG ATT GCCCTT C 
300 310 320 330 




FIG. I3B-2 
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Use of Recombinant T. li Inorganic Pyrophosphatase in 

DNA Sequencing 



800- 


CAC C AGT NNGAC G GGC 
310 320 


3669 3868 



Control - ml3mpl8, DyePrimer chemistry (-21mer), No PPase 

FIG. I4A 




3552 3751 



•f PPase - ml3mpl8, DyePrimer chemistry (-21mer) 
- 2 NEB units PPase (recombinant) 



FIG. I4B 
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Expression plasmid pAGR3: 5910 bp. 




on 



Promoter and Cloning Site Map (SEQ ID NO:28) 



lac operator 

1 GAAUGZGAGC^aCXCACA^TTCTAGGATGTTAATTGCGCCGACATCATAA 

-35 region 

5 1 CGGTTCTGGCAAATATTCTGAAATGAGCTGTXGACAATTAATCATCGGCT 

-10 region lac operator rbs 

101 C GX&1AAIG TGT GGA AXTSTfiAGJIGGAX^A CAAT T TCACAC AGjgAAACAG 

start 

151 ACCA1GGTGAATTCTAGAGCTCGAGGATCCGCGGTACCCGGGCATGCATT 
Ncol EcoRI Xbal SacI Xhol BamHI Sad I Kpnl Smal BstBI 

201 CGAAGCTTCCTTAAGCGGCCGTCGACCGATGCCCTTGAGAGCCTTCAACCA 
Hindlll Aflll EagI Sail 
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